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ATTORNEY DOCKET MO. 07891/003001 



Applicant or Patentee: Robert G. Korneluk et al. 

Serial or Patent No.: 08/576,956 

Filed or Issued: December 22, 1995 

For: MAMMALIAN IAP GENE FAMILY, PRIMERS, PROBES, AND DETECTION METHODS 



VERIFIED STATEMENT (DECLARATION) CLAIMING SHALL ENTITY STATUS 
(37 CFR 1.9(f) and 1.27(d)) - NONPROFIT ORGANIZATION 



I hereby declare that I am an official empowered to act on behalf of the nonprofit organization identified below: 
Name of Organization: University of Ottawa 

Address of Organization: 550 Cumberland, Ottawa, Ontario, Canada K1M 6N5 
Type of Organization: 



[X] UNIVERSITY OR OTHER INSTITUTION OF HIGHER EDUCATION 

C 3 TAX EXEMPT UNDER INTERNAL REVENUE SBmCB CODE (26 USC 501(a) and 501(c)(3)) 

£ 1 NONPROFIT SCIENTIFIC OR EDUCATIONAL UNDER STATUTE OF STATE OF THE UNITED STATES OF AMERICA 

(NAME OF STATE : ) 

(CITATION OF STATUTE: ) 

[ ] WOULD QUALIFY AS TAX EXEMPT UNDER INTERNAL REVENUE SERVICE CODE (26 USC 501(a) and 501(c)(3)) IF 

LOCATED IN THE UNITED STATES OF AMERICA 
C 3 WOULD QUALIFY AS NONPROFIT SCIENTIFIC OR EDUCATIONAL UNDER STATUTE OF STATE OF THE UNITED STATES OF 

AMERICA IF LOCATED IN THE UNITED STATES OF AMERICA 

(NAME OF STATE: ) 

(CITATION OF STATUTE: ) 



hereby declare that the nonprofit organization identified above qualifies as a nonprofit organization as defined in 37 CFR 
1#(e) for purposes of paying reduced fees under section 41(a) and (b) of Title 35, United States Code with regard to the 
Mention entitled MAMMALIAN IAP GENE FAMILY, PRIMERS, PROBES, AND DETECTION METHODS by inventor(s) Robert G. Korneluk, 
A&exander R. MacKenzie, and Stephen Baird described in 



^ 11 the specification filed herewith. 

£ [X] application serial no. 08/567,959, filed December 22, 1995. 

U C J patent no, , issued . 

^hereby declare that rights under contract or law have been conveyed to and remain with the nonprofit organization with 
regard to the above identified invention. 

If the rights held by the nonprofit organization are not exclusive, each individual, concern or organization having rights to 
ijjtje invention is listed below* and no rights to the invention are held by any person, other than the inventor, who could not 
qualify as a small business concern under 37 CFR 1.9(c) or by any concern which would not qualify as a small business concern 
arfcler 37 CFR 1.9(d) or a nonprofit organization under 37 CFR 1.9(e). 

L=. *N0TE: Separate verified statements are required from each named person, concern or organization having 

;Z rights to the invention averring to their status as small entities. (37 CFR 1.27) 

#«U Nane: Apoptogen. Inc. 

Address: 100 International Blvd., Etobicoke. Ontario. Canada M9W 6J6 



C J INDIVIDUAL 1X1 SMALL BUSINESS CONCERN £ j NONPROFIT ORGANIZATION 



I acknowledge the duty to file, in this application or patent, notification of any change in status resulting in loss of 
entitlement to small entity status prior to paying, or at the time of paying, the earliest of the issue fee or any 
maintenance fee due after the date on which status as a small entity is no longer appropriate. (37 CFR 1.28(b)) 



I hereby declare that all statements made herein of my 




any patent to which this verified statement is directed. 



Jean Farrall 



Director. Research Services 



Title: 

Address: University of Ottawa, 115 Seraphin Marion. Ottawa, Canada 
Signature: ^ ^ ^ ^^^r-t, t_ ft 



Date: / S~ ^ g 




ATTORNEY DOCKET NO. 07891/003001 



Applicant or Patentee: Robert G. Korneluk et al. ▼ 
Serial or Patent No.: 08/576,956 
Filed or Issued: December 22, 1995 

For: MAMMALIAN IAP GENE FAMILY, PRIMERS, PROBES, AND DETECTIONS METHODS 

VERIFIED STATEMENT (DECLARATION) CLAIMING SMALL ENTITY STATUS 
(37 CFR 1.9(f) and 1.27(C)) - SMALL BUSINESS CONCERN 

I hereby declare that I am 

[X] the owner of the small business concern identified below: 

t 3 an official of the small business concern empowered to act on behalf of the concern identified 
below: 

Hame of Snail Business Concern: Apoptogen. Inc. 

Address of Saall Business Concern: 100 International Blvd.. Etobicoke. Ontario. Canada M9W 6J6 

I hereby declare that the above identified small business concern qualifies as a small business concern as defined in 13 CFR 
121.12, and reproduced in 37 CFR 1.9(d), for purposes of paying reduced fees to the United States Patent and Trademark 
Office, in that the number of employees of the concern, including those of its affiliates, does not exceed 500 persons. For 
purposes of this statement, (1) the number of employees of the business concern is the average over the previous fiscal year 
of the concern of the persons employed on a full-time, part-time or temporary basis during each of the pay periods of the 
fiscal year, and (2) concerns are affiliates of each other when either, directly or indirectly l one concern controls or has 
the power to control the other, or a third party or parties controls or has the power to control both. 

I hereby declare that rights under contract or law have been conveyed to and remain with the smalt business concern 
identified above with regard to the invention, entitled MAMMALIAN IAP GENE FAMILY, PRIMERS, PROBES, AND DETECTION METHODS by 
tBrentor(s) Robert G. Korneluk, Alexander R. Mackenzie, and Stephen Baird described in 

I 3 the specification filed herewith. 
m [X] application serial no. 08/576,956, filed December 22, 1995. 

||! [ 3 patent no. , issued . 

ft the rights held by the above identified smajl business concern are not exclusive, each individual, concern or organization 
Nying rights to the invention is listed below and no rights to the invention are held by any person, other than the 
jffventor, who would not qualify as an independent inventor under 37 CFR 1.9(c) if that person made the invention, or by any 
Concern wjjich would not qualify as a small business concern under 37 CFR 1.9(d), or a nonprofit organization under 37 CFR 
SU9(e). NOTE: Separate verified statements are required from each named person, concern or organization having rights to 
the invention averring to their status as small entities. (37 CFR 1.27) 

Wll Naae: University of Ottawa 

Address: 550 Cumberland. Ottawa. Ontario, Canada K1N 6N5 

H t 3 INDIVIDUAL [ 3 SMALL BUSINESS CONCERN [X] NONPROFIT ORGANIZATION 

^acknowledge the duty to file, in this application or patent, notification of any change in status resulting in loss of 
entitlement to small entity status prior to paying, or at the time of paying, the earliest of the issue fee or any 
maintenance fee due after the date on which status as a small entity is no longer appropriate. (37 CFR 1.28(b)) 

I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and 
belief are believed to be true; and further that these statements were made with the knowledge that willful false statements 
and the like so made are punishable by fine or imprisonment, or both, under section 1001 of Title 18 of the United States 
Code, and that such willful false statements may jeopardize the validity of the application, any patent issuing thereon, or 
any patent on which this verified statement is directed. 

Na»e: Frank Gleeson 



Title: President and CEO 

Address: W^Et obi coke. Ontario. Canada M9W 6J6 ^ /qg> /^/tg^st/d/ion/ rf >£/i/c/ 

Signature: 



Date: *2 / 
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ATTORNEY DOCKET NO. 07891/003005 



Certificate of Mailing 

Date of Deposit September 1. 2000 Label Number: EL509049835US 



I hereby certify under 37 CFR 1.10 that this correspondence is being deposited with the United SteJsrPS§tel Service as 
"Express Mail Post Office to Addressee" with sufficient postage on the date indeed above^fid is add^ssed to BOX 
PATENT APPLICATION, Director for Patents, Washington, D.C. 20231 . 



Printed name of person mailing correspondence (SignatorTof person mailing correspondence 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Applicant: Robert G. Korneluk et ah Art Unit: Not Yet Assigned 

Serial No.: Not Yet Assigned Examiner: Not Yet Assigned 

Filed: September 1, 2000 

Title: MAMMALIAN IAP GENE FAMILY, PRIMERS, PROBES, AND 

DETECTION METHODS 

Director for Patents 
Washington, D.C. 20231 

PRELIMINARY AMENDMENT 
Prior to examination of the above-referenced application, kindly consider the 

following amendments and remarks. 

Please amend the application as follows: 
Tn the specification : 

At page 1, line 5, before "The invention relates to apoptosis.", add the following: 
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- Cross Reference To Related Applications 

This application is a continuation of U.S.S.N. 08/576,956, filed December 22, 
1995, which is a continuation-in-part of U.S.S.N. 08/51 1,485, filed August 4, 1995, now 
issued as U.S. Patent No. 5,919,912.--. 

At page 6, line 27, replace "IAP disease resistance gene" with --IAP gene--. 

At page 18, line 15, replace "Fig. 10 is a Northern blot" with -- Figs. 10A-C are a 
series of Northern blots 

At page 18, line 17, replace "Fig. 1 1 is a Northern blot" with - Figs. 1 1 A-C are a 
series of Northern blots 

At page 18, line 19, replace "Fig. 12 is a Northern blot" with ~ Figs. 12A-C are a 
series of Northern blots — . 

At page 19, line 1, after "Tables 1 and 2"insert ~(SEQ ID NOS: 45-92)-. 

At page 24, line 23, after "MEQKLISEEDL," insert - (SEQ ID NO: 43) --. 

At page 26, line 23, replace "Embo," with - EMBO --. 

At page 27, line 3, replace "Neurobiol," with - Neurobiol. --. 

At page 27, line 27, replace "Virol," with - Virol. --. 

At page 34, line 18, replace "Cell," with ~ Cell --. 

At page 34, line 18, replace " Nature ." with - Nature --. 

At page 36, line 8, replace "Med," with - Med. --. 

Kindly remove the sequence listing found at pages 51-88 and renumber the pages 
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of the claims and abstract consecutively thereafter. The enclosed amended sequence 
listing should be inserted at the end of the application. 

In the Claims : 

Cancel claims 2, 15-29, and 33-47. 

Amend claims 1, 3-7, 13, 14, and 30-32 as follows. 

1 . (Amended) A substantially [Substantially] pure nucleic acid encoding [an IAP] 
B a mammalian inhibitor of apoptosis protein (IA?) polypeptide, wherein said inhibitor of 
f\ apoptosis protein is a protein that modulates apoptosis an d comprises a ring zinc finger 

2 fRZF) domain and at least one baculovirus inhibitor of apop tosis repeat (BIR) domain. 

5 3 . (Amended) The nucleic acid of claim [2] 1, wherein said polypeptide has at 

O 

3 least two [BIR] baculovirus inhibitor of apoptosi s repeat (BJR) domains. 

4. (Amended) The nucleic acid of claim 3, wherein said polypeptide has at least 
three [BIR] baculovirus inhibitor of apoptosis repeat (BIR) domains. 

5. (Amended) The nucleic acid of claim 1, wherein said [DNA] nucleic acid 
contains the [xiap] X-linked inhibitor of apoptosis pro tein (xiap) gene. 
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6. (Amended) The nucleic acid of claim 1, wherein said [DNA] nucleic acid 
contains the [hiap2] human inhibitor of apoptosis prot ein 2 (biapl) gene. 

7. (Amended) The nucleic acid of claim 1, wherein said [DNA] nucleic acid 
contains the [hiapl] human inhibitor of apoptosis prote in 1 fliiapl) gene. 

DNA. 

13. (Amended) A [Substantially] substantially pure [DNA] nucleic acid having 
the sequence of Fig. 5 (SEP ID NO: 391 or degenerate variants thereof, and encoding the 
amino acid sequence of Fig. 5 fSEO ID NO: 401 

14. (Amended) A [Substantially] substantially pure [DNA] nucleic acid having the 
sequence of Fig. 6 (SEP ID NO: 41) . or degenerate variants thereof, and encoding the 
amino acid sequence of Fig. 6 (SEP ID NO: 42) . 

30. (Amended) A method of producing [an LAP] a mammalian inhibitor of 
apoptosis protein (IAP) polypeptide comprising: 

providing a cell transformed with [DNA] nucleic acid encoding [an] a mammalian 
IAP polypeptide positioned for expression in said cell , said polypeptide comprising a ring 
zinc finger (RZF) domain : 
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culturing said transformed cell under conditions for expressing said [DNA] nucleic 
acid ; and 

[isolating] producing said IAP polypeptide. 

3 1 . (Amended) The method of claim 30, wherein said mammalian inhibitor of 
apoptosis QAP) [IAP] polypeptide is murine human inhibitor of apo ptosis protein 1 fai- 
HIAP1) [HIAP1]. 

32. (Amended) The method of claim 30, wherein said mammalian inhibitor of 
apoptosis (TAP) [IAP] polypeptide is murine human inhibitor of apo ptosis protein 2 (m- 
HIAP2^ [HIAP2]. 

Add the following new claims 48-78. 

-48. A substantially pure nucleic acid that hybridizes to a probe of at least 40 
nucleotides in length, said probe derived from the nucleic acid sequence of Fig. 5 (SEQ 
ID NO: 39), wherein said nucleic acid hybridizes to said probe under low stringency 
conditions, said conditions comprising washing with 2X SSC at 40 °C, and wherein said 
nucleic acid encodes a mammalian inhibitor of apoptosis protein (IAP) polypeptide, said 
polypeptide comprising a ring zinc finger (RZF) domain and at least one baculovirus 
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inhibitor of apoptosis repeat (BIR) domain. 



49. A substantially pure nucleic acid that hybridizes to a probe of at least 40 
nucleotides in length, said probe derived from the nucleic acid sequence of Fig. 6 (SEQ 
ID NO: 41), wherein said nucleic acid hybridizes to said probe under low stringency 
conditions, said conditions comprising washing with 2X SSC at 40°C, and wherein said 
nucleic acid encodes a mammalian inhibitor of apoptosis protein (IAP) polypeptide, said 
polypeptide comprising a ring zinc finger (RZF) domain and at least one baculovirus 
5i inhibitor of apoptosis repeat (BIR) domain. 

+j 50. A substantially pure nucleic acid encoding a baculovirus inhibitor of apoptosis 

O repeat (BIR) domain, said nucleic acid comprising a sequence selected from the group 
g consisting of SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 49, SEQ 
S ID NO: 50, SEQ ID NO: 5 1, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID 

NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID 

NO: 63, SEQ ID NO: 65, SEQ ID NO: 66, and SEQ ID NO: 67. 

5 1 . A substantially pure nucleic acid encoding a ring zinc finger (RZF) domain, 
said nucleic acid comprising a sequence selected from the group consisting of SEQ ID 
NO: 48, SEQ ID NO: 52, SEQ ID NO: 56, SEQ ID NO: 60, SEQ ID NO: 64, and SEQ ID 
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NO: 68. 



52. The nucleic acid of claim 1, wherein said nucleic acid encodes an X-linked 
inhibitor of apoptosis protein (XIAP). 

53. The nucleic acid of claim 52, wherein said X-linked inhibitor of apoptosis 
protein (XIAP) is from a mouse. 

54. The nucleic acid of claim 52, wherein said X-linked inhibitor of apoptosis 
protein (XIAP) is from a human. 

55. The nucleic acid of claim 1, wherein said nucleic acid encodes a human 
inhibitor of apoptosis protein 1 (HIAP1). 

56. The nucleic acid of claim 55, wherein said human inhibitor of apoptosis 
protein 1 (HIAP1) is from a mouse. 

57. The nucleic acid of claim 55, wherein said human inhibitor of apoptosis 
protein 1 (HIAP1) is from a human. 
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58. The nucleic acid of claim 1, wherein said nucleic acid encodes a human 
inhibitor of apoptosis protein 2 (HIAP2). 

59. The nucleic acid of claim 58, wherein said human inhibitor of apoptosis 
protein 2 (HIAP2) is from a mouse. 

60. The nucleic acid of claim 58, wherein said human inhibitor of apoptosis 
protein 2 (HIAP2) is from a human. 

61 . The nucleic acid of claim 5, wherein said X-linked inhibitor of apoptosis 
protein (xiap) gene is from a mouse. 

62. The nucleic acid of claim 5, wherein said X-linked inhibitor of apoptosis 
protein (xiap) gene is from a human. 

63. The nucleic acid of claim 6, wherein said human inhibitor of apoptosis protein 
2 (hiap2) gene is from a mouse. 

64. The nucleic acid of claim 6, wherein said human inhibitor of apoptosis protein 
2 (hiap2) gene is from a human. 
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65. The nucleic acid of claim 7, wherein said human inhibitor of apoptosis protein 
1 (hiapl) gene is from a mouse. 

66. The nucleic acid of claim 7, wherein said human inhibitor of apoptosis protein 
1 (hiapl) gene is from a human. 

67. A substantially pure nucleic acid having the sequence of Fig. 1 (SEQ ID NO: 
3), or degenerate variants thereof, and encoding the amino acid sequence of Fig. 1 (SEQ 
ID NO: 4). 

68. A substantially pure nucleic acid having the sequence of Fig. 2 (SEQ ID NO: 
5), or degenerate variants thereof, and encoding the amino acid sequence of Fig. 2 (SEQ 
ID NO: 6). 

69. A substantially pure nucleic acid having the sequence of Fig. 3 (SEQ ID NO: 
7), or degenerate variants thereof, and encoding the amino acid sequence of Fig. 3 (SEQ 
ID NO: 8). 

70. A substantially pure nucleic acid having the sequence of Fig. 4 (SEQ ID NO: 
9), or degenerate variants thereof, and encoding the amino acid sequence of Fig. 4 (SEQ 
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ID NO: 10). 



71 . The method of claim 30, wherein said mammalian inhibitor of apoptosis 
protein (IAP) polypeptide is human inhibitor of apoptosis protein 1 (HIAP1). 

72. The method of claim 30, wherein said mammalian inhibitor of apoptosis 
protein (IAP) polypeptide is human inhibitor of apoptosis protein 2 (HIAP2). 

73. The method of claim 30, wherein said mammalian inhibitor of apoptosis 
protein (IAP) polypeptide is murine X-linked inhibitor of apoptosis protein (m-XIAP). 

74. The method of claim 30, wherein said mammalian inhibitor of apoptosis 
protein (IAP) polypeptide is human X-linked inhibitor of apoptosis protein (XIAP). 

75. A substantially pure nucleic acid that hybridizes to a probe of at least 40 
nucleotides in length, said probe derived from the DNA sequence of Fig. 1 (SEQ ID NO: 
3), wherein said nucleic acid hybridizes to said probe under low stringency conditions, 
said conditions comprising washing with 2X SSC at 40°C, and wherein said nucleic acid 
encodes a mammalian inhibitor of apoptosis protein (IAP) polypeptide, said polypeptide 
comprising a ring zinc finger (RZF) domain and at least one baculovirus inhibitor of 
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apoptosis repeat (BIR) domain. 



76. A substantially pure nucleic acid that hybridizes to a probe of at least 40 
nucleotides in length, said probe derived from the DNA sequence of Fig. 2 (SEQ ID NO: 
5), wherein said nucleic acid hybridizes to said probe under low stringency conditions, 
said conditions comprising washing with 2X SSC at 40°C, and wherein said nucleic acid 
encodes a mammalian inhibitor of apoptosis protein (IAP) polypeptide, said polypeptide 
comprising a ring zinc finger (RZF) domain and at least one baculovirus inhibitor of 
apoptosis repeat (BIR) domain. 

77. A substantially pure nucleic acid that hybridizes to a probe of at least 40 
nucleotides in length, said probe derived from the DNA sequence of Fig. 3 (SEQ ID NO: 
7), wherein said nucleic acid hybridizes to said probe under low stringency conditions, 
said conditions comprising washing with 2X SSC at 40°C, and wherein said nucleic acid 
encodes a mammalian inhibitor of apoptosis protein (IAP) polypeptide, said polypeptide 
comprising a ring zinc finger (RZF) domain and at least one baculovirus inhibitor of 
apoptosis repeat (BIR) domain. 

78. A substantially pure nucleic acid that hybridizes to a probe of at least 40 
nucleotides in length, said probe derived from the DNA sequence of Fig. 4 (SEQ ID NO: 
9), wherein said nucleic acid hybridizes to said probe under low stringency conditions, 
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said conditions comprising washing with 2X SSC at 40°C, and wherein said nucleic acid 
encodes a mammalian inhibitor of apoptosis protein (IAP) polypeptide, said polypeptide 
comprising a ring zinc finger (RZF) domain and at least one baculovirus inhibitor of 
apoptosis repeat (BIR) domain. — 

REMARKS 

In general, Applicants' presently claimed invention features substantially pure 
nucleic acids encoding mammalian IAP polypeptides and methods of using such nucleic 
acids to produce such mammalian IAP polypeptides. 

Support for the Amendments 

The specification and drawings have been amended to comply with the 
requirements of 37 C.F.R. § 1.821 through 1.825. The specification has also been 
amended to properly refer to each individual panel of a drawing. 

The specification and the claims have been amended to correct regrettable 
typographical errors. 

Applicants have added new claims 48 and 49 to cover substantially pure DNA 
encoding mammalian inhibitor of apoptosis protein (IAP) polypeptides that hybridize 
under low stringency conditions to SEQ ID NO: 39 and SEQ ID NO: 41, respectively. 
Support for these new claims may be found in the specification at page 48, lines 15-20. 

Applicants have added new claim 50 to cover a substantially pure DNA encoding a 
baculovirus inhibitor of apoptosis repeat domain that comprises the sequence of SEQ ID 
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NO: 45, SEQ ID NO: 46, SEQ ID NO: 47, SEQ ID NO: 49, SEQ ID NO: 50, SEQ ID 
NO: 51, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 57, SEQ ID 
NO: 58, SEQ ID NO: 59, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID 
NO: 65, SEQ ID NO: 66, or SEQ ID NO: 67. Support for this new claim can be found in 
the specification at page 19 in Table I (page 19, lines 12-20). The DNA of this claim 
finds use as, for example, a hybridization probe for screening libraries. 

Applicants have added new claim 51 to cover a substantially pure DNA encoding a 
ring zinc finger domain that comprises the sequence of SEQ ID NO: 48, SEQ ID NO: 52, 
SEQ ID NO: 56, SEQ ID NO: 60, SEQ ID NO: 64, or SEQ ID NO: 68. Support for this 
new claim can be found in the specification at page 19 in Table I (page 19, lines 12-20). 
The DNA of this claim finds use as, for example, a hybridization probe for screening 
libraries. 

Applicants have added new claim 52 to claim nucleic acid encoding an X-linked 
inhibitor of apoptosis protein (XIAP). New dependent claims 53 and 54 have been added 
to specifically claim nucleic acids encoding XIAP from a mouse and from a human, 
respectively. Support for these new claims may be found in the specification at page 21, 
lines 2-21, page 22, lines 8-32, and in Figs. 1 and 4. 

Applicants have added new claim 55 to claim nucleic acid encoding a human 
inhibitor of apoptosis protein 1 (HIAP1). New dependent claims 56 and 57 have been 
added to specifically claim nucleic acids encoding HI API from a mouse and from a 
human, respectively. Support for these new claims may be found in the specification at 
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page 21 , line 22 through page 22, line 7, and in Figs. 2 and 5. 

Applicants have added new claim 58 to claim nucleic acid encoding a human 
inhibitor of apoptosis protein 2 (HIAP2). New dependent claims 59 and 60 have been 
added to specifically claim nucleic acids encoding HIAP2 from a mouse and from a 
human, respectively. Support for these new claims may be found in the specification at 
page 21, line 22 through page 22, line 7, and in Figs. 3 and 6. 

Applicants have added new dependent claims 61 and 62 to specifically claim 
nucleic acids containing the X-linked inhibitor of apoptosis (xiap) gene, where the (xiap) 
gene is from a mouse or a human, respectively. Support for these new claims may be 
found in the specification at page 21, lines 2-21, page 22, lines 8-32, and in Figs. 1 and 4. 

Applicants have added new dependent claims 63 and 64 to specifically claim 
nucleic acids containing the human inhibitor of apoptosis 2 (hiap2) gene, where the 
(hiap2) gene is from a mouse or a human, respectively. Support for these new claims 
may be found in the specification at page 21, line 22 through page 22, line 7, and in Figs. 
3 and 6. 

Applicants have added new dependent claims 65 and 66 to specifically claim 
nucleic acids containing the human inhibitor of apoptosis 1 (hiapl) gene, where the 
(hiapl) gene is from a mouse or a human, respectively. Support for these new claims 
may be found in the specification at page 21, line 22 through page 22, line 7, and in Figs. 
2 and 5. 
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Applicants have added new claims 67, 68, 69, and 70 to specifically claim 
substantially pure nucleic acids having the sequence of and encoding the amino acid 
sequence of Figs. 1, 2, 3, and 4, respectively. Support for these new claims may be found 
in the specification, for example, at page 21, line 2 through page 22, line 32, and in Figs. 
1-4. 

Applicants have added new dependent claims 71, 72, 73, and 74 to specifically 
claim methods for producing human inhibitor of apoptosis protein 1, human inhibitor of 
apoptosis protein 2, murine X-linked inhibitor of apoptosis protein, and human X-linked 
inhibitor of apoptosis protein, respectively. Support for these new claims may be found 
in the specification, for example, at page 5, lines 7-12; at page 21, line 2 through page 22, 
line 32; and in Figs. 1-4. 

Applicants have added new claims 75-78 to specifically claim substantially pure 
nucleic acids encoding mammalian inhibitor of apoptosis protein (IAP) polypeptides that 
hybridize under low stringency conditions to probes derived from the DNA sequences of 
SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, and SEQ ID NO: 9, respectively. Support 
for these new claims may be found in the specification at page 48, lines 15-20, and in 
Figs. 1-4. No new matter is added by any of these amendments. 

Sequence Listing 

As required by 37 CFR 1.825(a), enclosed is an amended sequence listing 
consisting of 42 sheets to be inserted at the end of the application. The amendments to 
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the sequence listing provide each sequence in the specification with a unique SEQ ID 
NO, and contain no new matter. In particular, SEQ ID NOS: 69-92 have been added to 
include the sequences described in Table 2, found at page 20 of the specification. 

As required by 37 CFR 1.825(b), also enclosed is a diskette containing a copy of 
the sequence listing in computer readable form including all previously submitted data 
with the amendments incorporated therein. The contents of the computer readable form 
are the same as the contents of the paper sheets. 

If there are any charges or any credits, please apply them to Deposit Account No. 
03-2095. 
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AND DETECTION METHODS 

5 The invention relates to apoptosis. 

Background of the Invention 

There are two general ways by which cells die. An 
easily recognized pathway is necrosis, a process of cell 
death usually resulting from severe and sudden injury* In 

10 necrosis, changes in cellular homeostasis occur with loss of 
membrane integrity. Dysregulation of osmotic pressure 
results and, as a consequence, the cells swell and finally 
rupture. The cellular contents are then spilled into the 
surrounding tissue space and, usually, an inflammation 

15 response ensues. A second form of cell death is apoptosis. 
This cell "suicide" pathway or programmed cell death often 
occurs so rapidly that in some biological systems the 
apoptotic process is difficult to ascertain. Indeed, it has 
been only in the past few years that the involvement of 

20 apoptosis in a wide spectrum of biological processes has 

become recognized. Apoptosis is a fundamental physiological 
pathway of cell death, highly conserved throughout 
evolution, and playing a major role in development, viral 
pathogenesis, cancer, autoimmune diseases and 

25 neurodegenerative disorders. 

Inappropriate increases in apoptosis may cause or 
contribute to a variety of diseases, including AIDS, 
neurodegenerative diseases (e.g. Alzheimer's Disease, 
Parkinson's Disease, Amyotrophic Lateral Sclerosis (ALS) , 

3 0 retinitis pigmentosa and other diseases of the retina, 
myelodysplastic syndrome (e.g., aplastic anemia), toxin- 



induced liver disease (e.g., alcoholism) and ischemic injury 
(e.g., myocardial infarction, stroke, and reperfusion 
injury) . In addition, disruption of normally occurring 
apoptosis has been implicated in the development of some 
5 cancers (e.g. follicular lymphoma, p53 carcinomas, and 

hormone dependent tumors), autoimmune disorders (e.g., lupus 
erythematosis and multiple sclerosis) and viral infections 
(e.g., herpes virus, poxvirus, and adenovirus infections). 
Mature CD4 + T- lymphocytes in patients with HIV-1 

10 have been observed to respond to stimulation with mitogens 
or super-antigens by undergoing increased apoptosis. The 
great majority of these cells are not infected and similar 
inappropriate antigen- induced apoptosis could be very 
important in the destruction of this vital part of the 

15 immune system early in HIV infection. 

Baculoviruses encode inhibitors of apoptosis 
proteins (IAPs) . These proteins inhibit the apoptosis which 
otherwise occurs when insect cells are infected by the 
virus. Baculovirus IAP proteins work in a manner which is 

20 thought to be independent of other viral proteins. The 

baculovirus IAP genes include sequences encoding a ring zinc 
finger- like motif which is presumed to be involved in the 
direct binding of DNA. 

Summary of the Invention 
25 in general, the invention features substantially 

pure DNA (for example, genomic DNA, cDNA, or synthetic DNA) 
encoding a mammalian IAP polypeptide as defined below. In 
related aspects, the invention also features a vector, a 
cell (e.g., a mammalian, yeast or bacterial cell), and a 
3 0 transgenic animal or embryo thereof which includes such a 
substantially pure DNA encoding an IAP polypeptide. 
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In preferred embodiments, an IAP gene is the xiap 
(including human xiap and its murine homolog, m-xiap) , hiapl 
(including human hiapl and m-hiapl) , or the hiap2 gene 
(including human hiap2 and m-hiap2) . In most preferred 
embodiments the IAP gene is a human IAP gene. In other 
various preferred embodiments, the cell is a transformed 
cell. In related aspects, the invention features a 
transgenic animal containing a transgene which encodes an 
IAP polypeptide that is expressed in or delivered to tissue 
normally susceptible to apoptosis. 

In yet another aspect, the invention features DNA 
encoding fragments of IAP polypeptides including the BIR 
domains and the RZF domains provided herein. 

In specific embodiments, the invention features DNA 
sequences substantially identical to the DNA sequences shown 
in Figs. 1-6. 

In another aspect, the invention also features RNA 
which is encoded by the DNA described herein. Preferably, 
the RNA is mRNA. In another embodiment the RNA is antisense 
RNA. 

In another aspect, the invention features a 
substantially pure polypeptide having a sequence 
substantially identical to one of the IAP amino acid 
sequences shown in Figures 1-6. 

In a second aspect, the invention features a 
substantially pure DNA which includes a promoter capable of 
expressing the IAP gene in a cell susceptible to apoptosis. 
In preferred embodiments, the IAP gene is xiap (including 
the human or murine xiap) , hiapl (preferably the human or 
murine hiapl) , or hiap2 (preferably the human or murine 
hiap2) . hiap2 may be the full length gene, as shown in Fig. 
3 , or the truncated variant having the sequence boxed in 
Fig. 3 deleted. 



In preferred embodiments, the promoter is the 
promoter native to an IAP gene. Additionally, 
transcriptional and translational regulatory regions are 
preferably native to an IAP gene. 
5 In another aspect, the invention provides transgenic 

cell lines and transgenic animals. The transgenic cells of 
the invention are preferably cells which are susceptible to 
apoptosis. In preferred embodiments, the transgenic cell is 
a fibroblast, neuronal cell, a lymphocyte cell, or an insect 
10 cell. Most preferably, the neuron is a motor neuron and the 
lymphocyte is a CD4 + T-cell. 

In another aspect, the invention features a method 
O of inhibiting apoptosis which involves producing a 

J: transgenic cell having a transgene encoding an IAP 

ill 15 polypeptide wherein the transgene is integrated into the 
Hh genome of the cell and is positioned for expression in the 

jq cell and wherein the IAP transgene is expressed in the cell 

W at a level sufficient to inhibit apoptosis. 

^ In a related aspect, the invention features a 

yg 20 transgenic animal, preferably a mammal, more preferably a 
y rodent, and most preferably a mouse, having either increased 

r| copies of IAP genes inserted into the genome or a knockout 

O of an IAP gene in the genome. The transgenic animals may 

express an increased amount of IAP polypeptide or may 
25 express a decreased amount of an IAP polypeptide, 

respectively. In related embodiments, the invention 
provides a method of utilizing the IAP nucleic acid to 
engineer a knockout mutation in an IAP gene and a method of 
making an animal with increased expression by insertion of 
30 IAP gene into the genome. 

In another aspect, the invention features a method 
of detecting an IAP in a cell involving: (a) contacting the 
IAP gene or a portion thereof greater than 9 nucleic acids, 
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preferably greater than 18 nucleic acids in length with a 
preparation of genomic DNA from the cell under hybridization 
conditions providing detection of DNA sequences having about 
50% or greater nucleotide sequence identity to the amino 
5 acid encoding DNA sequences of hiapl, hiap2 , or xiap IAP 
polypeptides . 

In another aspect, the invention features a method 
of producing an IAP polypeptide which involves: (a) 
providing a cell transformed with DNA encoding an IAP 

10 polypeptide positioned for expression in the cell; (b) 

culturing the cell under conditions for expressing the DNA; 
and (c) isolating the IAP polypeptide. In preferred 
embodiments the IAP polypeptide is expressed by DNA which 
has a constituative or inducible promotor. In our 

15 embodiment, the promotor is a heterologous promotor. 

In another aspect, the invention features 
substantially pure mammalian IAP polypeptide. Preferably, 
the polypeptide includes a greater than 50 amino acid 
sequence substantially identical to a greater than 50 amino 

20 acid sequence shown in any one of Figs. 1-4. Most 

preferably, the polypeptide is the human or murine XIAP, 
HIAP1, or HIAP2 polypeptide. Fragments including BIR 
domains and RZF-domains provided herein are also a part of 
the invention. 

25 In another aspect, the invention features a 

recombinant mammalian polypeptide capable of modulating 
apoptosis wherein the polypeptide includes at least a ring 
zinc finger domain and a BIR domain as defined herein. In 
preferred embodiments, the invention features a 

3 0 substantially pure polypeptide and an oligonucleotide 

encoding said polypeptide, the polypeptide including a ring 
zinc finger (RZF) having the sequence: 
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Glu Xaal Xaal Xaal Xaal Xaal Xaal Xaa2 Xaal Xaal 
Xaal Cys Lys Xaa3 Cys Met Xaal Xaal Xaal Xaal Xaal Xaa3 Xaal 
Phe Xaal Pro Cys Gly His Xaal Xaal Xaal Cys Xaal Xaal Cys 
Ala Xaal Xaal Xaal Xaal Xaal Cys Pro Xaal Cys, wherein Xaal 
5 is any amino acid, Xaa2 is Glu or Asp, Xaa3 is Val or lie 
(SEQ ID N0:1); and at least one BIR domain having the 
sequence: Xaal Xaal Xaal Arg Leu Xaal Thr Phe Xaal Xaal Trp 
Pro Xaa2 Xaal Xaal Xaa2 Xaa2 Xaal Xaal Xaal Xaal Leu Ala 
Xaal Ala Gly Phe Tyr Tyr Xaal Gly Xaal Xaal Asp Xaal Val 
10 Xaal Cys Phe Xaal Cys Xaal Xaal Xaal Xaal Xaal Xaal Trp Xaal 
Xaal Xaal Asp Xaal Xaal Xaal Xaal Xaal His Xaal Xaal Xaal 
Xaal Pro Xaal Cys Xaal Phe Val, wherein Xaal may be any 
amino acid and Xaa2 may be any amino acid or may be absent 
(SEQ ID NO:2) . 

15 In various preferred embodiments the protein has at 

i. least two or, more preferably at least three BIR domains, 

j the RZF domain has one of the IAP sequences shown in Fig, 6, 

and the BIR domains are comprised of BIR domains shown in 
Fig, 5. In other preferred embodiments the BIR domains are 
} 2 0 at the amino terminal end of the protein relative to the RZF 

domain, which is at or near the carboxy terminus of the 
'* polypeptide. 

; In another aspect, the invention features an IAP 

gene isolated according to the method involving: (a) 

2 5 providing a sample of DNA; (b) providing a pair of 

oligonucleotides having sequence homology to a conserved 
region of an IAP disease-resistance gene; (c) combining the 
pair of oligonucleotides with the cell DNA sample under 
conditions suitable for polymerase chain reaction-mediated 

3 0 DNA amplification; and (d) isolating the amplified IAP gene 

or fragment thereof. 
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In preferred embodiments, the amplification is 
carried out using a reverse-transcription polymerase chain 
reaction, for example, the RACE method. 

In another aspect, the invention features an IAP 
5 gene isolated according to the method involving: (a) 

providing a preparation of DNA; (b) providing a detectably- 
labelled DNA sequence having homology to a conserved region 
of an IAP gene; (c) contacting the preparation of DNA with 
the detectably-labelled DNA sequence under hybridization 
10 conditions providing detection of genes having 50% or 

greater nucleotide sequence identity; and (d) identifying an 
IAP gene by its association with the detectable label. 

In another aspect, the invention features an IAP 
gene isolated according to the method involving: (a) 
15 providing a cell sample; (b) introducing by transformation 
into the cell sample a candidate IAP gene; (c) expressing 
the candidate IAP gene within the cell sample; and (d) 
determining whether the cell sample exhibits an altered 
apoptotic response, whereby a response identifies an IAP 

2 0 gene. 

In another aspect, the invention features a method 
of identifying an IAP gene in a cell, involving: (a) 
providing a preparation of cellular DNA (for example, from 
the human genome or a cDNA library (such as a cDNA library 
25 isolated from a cell type which undergoes apoptosis) ; (b) 
providing a detectably-labelled DNA sequence (for example, 
prepared by the methods of the invention) having homology to 
a conserved region of an IAP gene; (c) contacting the 
preparation of cellular DNA with the detectably-labelled DNA 

3 0 sequence under hybridization conditions providing detection 

of genes having 50% nucleotide or greater sequence identity; 
and (d) identifying an IAP gene by its association with the 
detectable label . 
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In another aspect, the invention features a method 
of isolating an IAP gene from a recombinant library, 
involving: (a) providing a recombinant library; (b) 
contacting the library with a detectably-labelled gene 
5 fragment produced according to the PCR method of the 

invention under hybridization conditions providing detection 
of genes having 50% or greater nucleotide sequence identity; 
and (c) isolating an IAP gene by its association with the 
detectable label. 

10 In another aspect, the invention features a method 

of identifying an IAP gene involving: (a) providing a cell 
tissue sample; (b) introducing by transformation into the 
cell sample a candidate IAP gene; (c) expressing the 
candidate IAP gene within the cell sample; and (d) 

15 determining whether the cell sample exhibits inhibition of 
apoptosis, whereby a change in (i.e. modulation of) 
apoptosis identifies an IAP gene. 

Preferably, the cell sample is a cell type which may 
be assayed for apoptosis (e.g., lymphocytes, T-cells and B- 

20 cells, neuronal cells, baculovirus infected insect cells and 
fibroblast cells) ; the candidate IAP gene is obtained from a 
cDNA expression library; and the apoptosis response is the 
inhibition of apoptosis. 

In another aspect, the invention features a method 

25 of inhibiting apoptosis in a mammal wherein the method 
includes: (a) providing DNA encoding at least one IAP 
polypeptide to a cell which is susceptible to apoptosis; 
wherein the DNA is integrated into the genome of the cell 
and is positioned for expression in the cell; and the IAP 

3 0 gene is under the control of regulatory sequences suitable 
for controlled expression of the gene(s) ; wherein the IAP 
transgene is expressed at a level sufficient to inhibit 
apoptosis relative to a cell lacking the IAP transgene. It 
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will be appreciated that IAP polypeptides also may be 
administered directly to inhibit any undesirable apoptosis. 

In a related aspect, the invention features a method 
of inhibiting apoptosis wherein the method involves: (a) 
producing a cell having integrated in the genome a transgene 
containing the IAP gene under the control of a promoter 
providing constitutive expression of the IAP gene. 

In yet another related aspect, the invention 
features a method of inhibiting apoptosis wherein the method 
involves: (a) producing a cell having integrated in the 
genome a transgene containing the IAP gene under the control 
of a promoter providing controllable expression of the IAP 
gene; and (b) regulating the environment of the cell so that 
the IAP transgene is controllably expressed in the cell. In 
preferred embodiments, the IAP gene is expressed using a 
tissue-specific or cell type-specific promoter, or by a 
promoter that is activated by the introduction of an 
external signal or agent, such as a chemical signal or 
agent. In preferred embodiments the cell is a lymphocyte or 
B-cell, a neuronal cell, or a fibroblast. In other 
embodiments the cell is a cell in an HIV infected human, or 
a mammal with a neurodegenerative disease, ischemia, toxin 
induced liver disease, or a myelodysplastic syndrome. 

In a related aspect, the invention provides a method 
of inhibiting apoptosis in a mammal by providing an 
apoptosis-inhibiting amount of IAP polypeptide. 

In another aspect, the invention features a purified 
antibody which binds specifically to an IAP family protein. 
Such an antibody may be used in any standard immunodetection 
method for the identification of an IAP polypeptide. 
Preferably, the antibody binds specifically to xiap, hiapl 
or hiap2 . In various embodiments the antibody may react 
with other IAP polypeptides or may be specific for one or a 



few IAP polypeptides ♦ The antibody may be a monoclonal 
polyclonal antibody. Preferably, the antibody reacts 
specifically with only one of the IAP polypeptides, for 
example, reacts with murine and human xiap, but not with 
5 hiapl or hiap2 from mammalian species. 

In another aspect, the invention features a method 
of identifying a compound which modulates apoptosis. The 
method includes (a) providing a cell expressing an IAP 
polypeptide; and (b) contracting the cell with a candidate 

10 compound, and monitoring the expression of an IAP gene. An 
alteration in the level of expression of the IAP gene 
indicates the presence of a compound which modulates 
apoptosis. The compound may be an inhibitor or an enhancer 
of apoptosis. In various preferred embodiments, the cell is 

15 a fibroblast, a neuronal cell, a lymphocyte (T-cell or B- 
cell) , or an insect cell; the polypeptide expression being 
monitored is XIAP, HIAP1, or HIAP2 (e.g., human or murine). 

In a related aspect, the invention features methods 
of detecting compounds which modulate apoptosis using the 

20 interaction trap technology and IAP polypeptides or 

fragments thereof as a component of the bait* In preferred 
embodiments, the compound being tested as a modulator of 
apoptosis is also a polypeptide. 

In another aspect, the invention features a method 

25 for diagnosing a cell proliferation disease, or an increased 
liklihood of such a disease, using an IAP nucleic acid probe 
or antibody. Preferably, the disease is a cancer. Most 
preferably, the disease is selected from the group 
consisting of promyelocytic leukemia, a Hela-type carcinoma, 

3 0 chronic myelogenous leukemia (preferably using xiap or hiap2 
related probes) , lymphoblastic leukemia (preferably using a 
xiap related probe), Burkitt's lymphoma (preferably using an 
hiapl related probe) , colorectal adenocarcinoma, lung 



carcinoma, and melanoma (preferably using a xiap probe) . 
Preferably, a diagnosis is indicated by a 2-fold increase in 
expression or activity, more preferably, at least a 10-fold 
increase in expression or activity. 

By "IAP gene" is meant a gene encoding a polypeptide 
having at least one BIR domain and a ring zinc finger domain 
which is capable of modulating (inhibiting or enhancing) 
apoptosis in a cell or tissue when provided by other 
intracellular or extracellular delivery methods ♦ In 
preferred embodiments the IAP gene is a gene having about 
50% or greater nucleotide sequence identity to at least one 
of the IAP amino acid encoding sequences of Figs. 1-4 or 
portions thereof. Preferably, the region of sequence over 
which identity is measured is a region encoding at least one 
BIR domain and a ring zinc finger domain. Mammalian IAP 
genes include nucleotide sequences isolated from any 
mammalian source. Preferably, the mammal is a human. 

By an "IAP gene" is also meant any member of the 
family of apoptosis inhibitory genes characterized by their 
ability to modulate apoptosis and having at least 20%, 
preferably 3 0%, and most preferably 50% amino acid sequence 
identity to at least one of the conserved regions of one of 
the IAP members described herein (i.e., either the BIR or 
ring zinc finger domains from the human or murine xiap, 
hiapl and hiap2) . Representative members of the IAP gene 
family include, without limitation, the human and murine 
xiap, hiapl, and hiap2 genes. By "IAP protein" is meant a 
polypeptide encoded by an IAP gene. 

By "BIR domain" is meant a domain having the amino 
acid sequence of the consensus sequence; Xaal Xaal Xaal Arg 
Leu Xaal Thr Phe Xaal Xaal Trp Pro Xaa2 Xaal Xaal Xaa2 Xaa2 
Xaal Xaal Xaal Xaal Leu Ala Xaal Ala Gly Phe Tyr Tyr Xaal 
Gly Xaal Xaal Asp Xaal Val Xaal Cys Phe Xaal Cys Xaal Xaal 
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Xaal Xaal Xaal Xaal Trp Xaal Xaal Xaal Asp Xaal Xaal Xaal 
Xaal Xaal His Xaal Xaal Xaal Xaal Pro Xaal Cys Xaal Phe Val, 
wherein Xaal is any amino acid and Xaa2 is any amino acid or 
is absent (SEQ ID NO:2). Preferably, the sequence is 
substantially identical to one of the BIR domain sequences 
provided for xiap, hiapl, hiap2 herein. 

By "ring zinc finger" or "RZF" is meant a domain 
having the amino acid sequence of the consensus sequence: 
Glu Xaal Xaal Xaal Xaal Xaal Xaal Xaa2 Xaal Xaal Xaal Cys 
Lys Xaa3 Cys Met Xaal Xaal Xaal Xaal Xaal Xaa3 Xaal Phe Xaal 
Pro Cys Gly His Xaal Xaal Xaal Cys Xaal Xaal Cys Ala Xaal 
Xaal Xaal Xaal Xaal Cys Pro Xaal Cys, wherein Xaal is any 
amino acid, Xaa2 is Glu or Asp, and Xaa3 is Val or lie (SEQ 
ID N0:1). Preferably, the sequence is substantially 
identical to the RZF domains provided herein for the human 
or murine xiap, hiapl, or hiap2 . 

By "modulating apoptosis" or "altering apoptosis" is 
meant increasing or decreasing the number of cells which 
undergo apoptosis in a given cell population. Preferably, 
the cell population is selected from a group including T- 
cells, neuronal cells, fibroblasts, or any other cell line 
known to undergo apoptosis in a laboratory setting (e.g., 
the baculovirus infected insect cells) . It will be 
appreciated that the degree of modulation provided by an IAP 
or modulating compound in a given assay will vary, but that 
one skilled in the art can determine the statistically 
significant change in the level of apoptosis which 
identifies an IAP or a compound which modulates an IAP. 

By "inhibiting apoptosis" is meant any decrease in 
the number of cells which undergo apoptosis relative to an 
untreated control. Preferably, the decrease is at least 
25%, more preferably the decrease is 50%, and most 
preferably the decrease is at least one-fold. 

- 12 - 



By "polypeptide" is meant any chain of amino acids, 
regardless of length or post-translational modification 
(e.g., glycosylation or phosphorylation). 

By "substantially identical" is meant a polypeptide 
or nucleic acid exhibiting at least 50%, preferably 85%, 
more preferably 90%, and most preferably 95% homology to a 
reference amino acid or nucleic acid sequence. For 
polypeptides, the length of comparison sequences will 
generally be at least 16 amino acids, preferably at least 20 
amino acids, more preferably at least 25 amino acids, and 
most preferably 35 amino acids. For nucleic acids, the 
length of comparison sequences will generally be at least 50 
nucleotides, preferably at least 60 nucleotides, more 
preferably at least 75 nucleotides, and most preferably 110 
nucleotides. 

Sequence identity is typically measured using 
sequence analysis software (e.g., Sequence Analysis Software 
Package of the Genetics Computer Group, University of 
Wisconsin Biotechnology Center, 1710 University Avenue, 
Madison, WI 53705) . Such software matches similar sequences 
by assigning degrees of homology to various substitutions, 
deletions, and other modifications. Conservative 
substitutions typically include substitutions within the 
following groups: glycine, alanine, valine, isoleucine, 
leucine; aspartic acid, glutamic acid, asparagine, 
glutamine; serine, threonine; lysine, arginine; and 
phenylalanine , tyrosine . 

By a "substantially pure polypeptide" is meant an 
IAP polypeptide which has been separated from components 
which naturally accompany it. Typically, the polypeptide is 
substantially pure when it is at least 60%, by weight, free 
from the proteins and naturally-occurring organic molecules 
with which it is naturally associated. Preferably, the 
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preparation is at least 75%, more preferably at least 90%, 
and most preferably at least 99%, by weight, IAP 
polypeptide. A substantially pure IAP polypeptide may be 
obtained, for example, by extraction from a natural source 
(e.g., a fibroblast, neuronal cell, or lymphocyte cell); by 
expression of a recombinant nucleic acid encoding an IAP 
polypeptide; or by chemically synthesizing the protein. 
Purity can be measured by any appropriate method, e.g., 
those described in column chromatography, polyacrylamide gel 
electrophoresis, or by HPLC analysis. 

A protein is substantially free of naturally 
associated components when it is separated from those 
contaminants which accompany it in its natural state. Thus, 
a protein which is chemically synthesized or produced in a 
cellular system different from the cell from which it 
naturally originates will be substantially free from its 
naturally associated components. Accordingly, substantially 
pure polypeptides include those derived from eukaryotic 
organisms but synthesized in E* coli or other prokaryotes. 

By "substantially pure DNA" is meant DNA that is 
free of the genes which, in the naturally-occurring genome 
of the organism from which the DNA of the invention is 
derived, flank the gene. The term therefore includes, for 
example, a recombinant DNA which is incorporated into a 
vector; into an autonomously replicating plasmid or virus; 
or into the genomic DNA of a prokaryote or eukaryote; or 
which exists as a separate molecule (e.g., a cDNA or a 
genomic or cDNA fragment produced by PCR or restriction 
endonuclease digestion) independent of other sequences. It 
also includes a recombinant DNA which is part of a hybrid 
gene encoding additional polypeptide sequence. 

By "transformed cell" is meant a cell into which (or 
into an ancestor of which) has been introduced, by means of 
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recombinant DMA techniques, a DNA molecule encoding (as used 
herein) an IAP polypeptide. 

By "transgene" is meant any piece of DNA which is 
inserted by artifice into a cell, and becomes part of the 
5 genome of the organism which develops from that cell. Such 
a transgene may include a gene which is partly or entirely 
heterologous (i.e., foreign) to the transgenic organism, or 
may represent a gene homologous to an endogenous gene of the 
organism. 

10 By "transgenic" is meant any cell which includes a 

DNA sequence which is inserted by artifice into a cell and 
becomes part of the genome of the organism which develops 
from that cell. As used herein, the transgenic organisms are 
generally transgenic mammalian (e.g., rodents such as rats 

15 or mice) and the DNA (transgene) is inserted by artifice 
into the nuclear genome. 

By "transformation" is meant any method for 
introducing foreign molecules into a cell. Lipofection, 
calcium phosphate precipitation, retroviral deliver, 

20 electroporation and biolistic transformation are just a few 
of the teachings which may be used. For example, Biolistic 
transformation is a method for introducing foreign molecules 
into a cell using velocity driven microprojectiles such as 
tungsten or gold particles. Such velocity-driven methods 

25 originate from pressure bursts which include, but are not 

limited to, helium-driven, air-driven, and gunpowder-driven 
techniques. Biolistic transformation may be applied to the 
transformation or transfection of a wide variety of cell 
types and intact tissues including, without limitation, 

30 intracellular organelles (e.g., and mitochondria and 

chloroplasts) , bacteria, yeast, fungi, algae, animal tissue, 
and cultured cells. 
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By "positioned for expression" is meant that the DNA 
molecule is positioned adjacent to a DNA sequence which 
directs transcription and translation of the sequence (i.e., 
facilitates the production of, e.g., an IAP polypeptide, a 
recombinant protein or a RNA molecule) . 

By "reporter gene" is meant a gene whose expression 
may be assayed; such genes include, without limitation, B- 
glucuronidase (GUS) , luciferase, chloramphenicol 
transacetylase (CAT) , and B-galactosidase. 

By "promoter" is meant minimal sequence sufficient 
to direct transcription. Also included in the invention are 
those promoter elements which are sufficient to render 
promoter-dependent gene expression controllable for cell- 
type specific, tissue-specific or inducible by external 
signals or agents; such elements may be located in the 5' or 
3' regions of the native gene. 

By "operably linked" is meant that a gene and a 
regulatory sequence (s) are connected in such a way as to 
permit gene expression when the appropriate molecules (e.g., 
transcriptional activator proteins) are bound to the 
regulatory sequence (s) . 

By "conserved region" is meant any stretch of six or 
more contiguous amino acids exhibiting at least 3 0%, 
preferably 50%, and most preferably 70% amino acid sequence 
identity between two or more of the IAP family members, 
(e.g., between human HIAP1, HIAP2, and XIAP) . Examples of 
preferred conserved regions are shown (as boxed or 
designated sequences) in Figures 5-7 and Tables 1 and 2, and 
include, without limitation, BIR domains and ring zinc 

finger domains. 

By "detectably-labelled" is meant any means for 
marking and identifying the presence of a molecule, e.g., an 
oligonucleotide probe or primer, a gene or fragment thereof, 



or a cDNA molecule. Methods for detectably- label ling a 
molecule are well known in the art and include, without 
limitation, radioactive labelling (e.g., with an isotope 
such as 32 P or 35 S) and nonradioactive labelling (e.g., 
5 chemi luminescent labelling, e.g., fluorescein labelling). 

By "purified antibody" is meant antibody which is at 
least 60%, by weight, free from proteins and naturally- 
occurring organic molecules with which it is naturally 
associated. Preferably, the preparation is at least 75%, 

10 more preferably 90%, and most preferably at least 99%, by 
weight, antibody, e.g., an IAP specific antibody. A 
purified antibody may be obtained, for example, by affinity 
chromatography using recombinantly-produced protein or 
conserved motif peptides and standard techniques. 

15 By "specifically binds" is meant an antibody which 

recognizes and binds a protein but which does not 
substantially recognize and bind other molecules in a 
sample, e.g., a biological sample, which naturally includes 
protein. 

2 0 Other features and advantages of the invention will 

be apparent from the following description of the preferred 
embodiments thereof, and from the claims. 



Detailed Description 
The drawings will first be described. 
25 Drawings 

Fig. 1 is the human xiap cDNA sequence and the XIAP 
polypeptide sequence (SEQ ID NOS:3, 4). 

Fig. 2 is the human hiapl cDNA sequence and the 
HIAP1 polypeptide sequence (SEQ ID N0S:5, 6). 
30 Fig. 3 is the human hiap2 cDNA sequence and the 

HIAP2 polypeptide sequence (SEQ ID NOS:7, 8). The sequence 
absent in the hiap2-G variant is boxed. 
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Fig, 4 is the murine xiap cDNA sequence and encoded 
murine XIAP polypeptide sequence (SEQ ID N0S:9, 10). 

Fig. 5 is the murine hiapl cDNA sequence and the 
encoded murine HIAP1 polypeptide sequence (SEQ ID NOS:39, 
5 40) . 

Fig. 6 is the murine hiap2 cDNA sequence and the 
encoded murine HIAP2 polypeptide SEQ ID N0S:41, 42). 

Fig. 7 shows the alignment of the BIR domains of IAP 
proteins (SEQ ID NOS: 11 and 14-31) . 
10 Fig. 8 is the alignment of human IAP polypeptides 

with diap, cp-iap, and the consensus sequence (SEQ ID NOS: 4, 
6, 8, 10, 12, and 13) . 

Fig. 9 shows the alignment of the Ring Zinc Finger 
domains of IAP proteins (SEQ ID NOS: 32-38). 
15 Fig. 10 is a Northern blot showing human hiapl and 

hiap2 mRNA expression in human tissues. 

Fig. 11 is a Northern blot showing human hiap2 mRNA 
expression in human tissues. 

Fig. 12 is a Northern blot showing human xiap mRNA 
20 expression in human tissues. 

Fig. 13A and 13B are agarose gels showing apoptic 
DNA ladders and RT PCR products using hiapl and hiap2 
specific probes in HIV infected T cells. 

Fig. 14A - 14D are graphs showing apoptosis 
25 suppression by XIAP, MAPI, HIAP2 , bcl-2m, smn and 6-myc. 

I. IAP Polypeptides and Genes Encoding IAP po lypeptides 

We have discovered a new class of mammalian proteins 
which modulate apoptosis (IAPs) and the genes which encode 
these proteins. The IAP proteins are characterized by the 
30 presence of a ring zinc finger (RZF) domain (Fig. 9) and at 
least one BIR domain as defined by the boxed consensus 
sequences in Figs. 7 and 8 and by the sequence domains 



provided in Tables 1 and 2. As examples of the IAP proteins 
we provide the cDNA sequences and amino acid sequences for 
these new human and murine apoptosis inhibitors, HIAP1, 
HIAP2 , and XIAP, Additional members of the mammalian IAP 
family (including homologs from other species and mutant 
sequences) may be isolated using standard cloning techniques 
and the conserved amino acid sequences, primers and probes 
provided herein and known in the art. 

This application is related to U.S. Serial Number 
08/511,485, filed August 4, 1995. U.S. S.N 08/511,485 is 
hereby incorporated by reference. 



TABLE 1 

NUCLEOTIDE POSITION OF CONSERVED DOMAINS* 





BIR-l 


BIR-2 


BIR-3 


Ring Zinc 
Finger 


h-xiap 


109 - 312 


520 - 723 


826 - 1023 


1348-1485 


m-xiap 


202 - 405 


613 - 816 


916 - 1113 


1438-1575 


h-hiapl 


273 - 476 


693 - 893 


951 - 1154 


1824-1961 


m-hiapl 


251 - 453 


670 - 870 


928 - 1131 


1795-1932 


h-hiap2 


373 - 576 


787 - 987 


1042-1245 


1915-2052 


m-hiap2 


215 - 418 


608 - 808 


863 - 1066 


1763-1876 



Positions indicate correspond to those shown in Figs. 1-4. 
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TABLE 2 

AMINO ACID POSITION OF CONSERVED DOMAINS* 





BIR-1 


BIR-2 


BIR-3 


Ring Zinc 
Finger 


h-Xiap 


26 - 93 


163 - 230 


265 - 330 


439 - 484 


m-Xiap 


26 - 93 


163 - 230 


264 - 329 


438 - 483 


h-Hiapl 


29 - 96 


169 - 235 


255 - 322 


546 - 591 


m-Hiapl 


29 - 96 


169 - 235 


255 - 322 


544 - 589 


h-Hiap2 


46 - 113 


184 - 250 


269 - 336 


560 - 605 


m-Hiap2 


25 - 92 


156 - 222 


241 - 308 


541 - 578 



Positions indicate correspond to those shown m Figs. 1-4. 



10 Recognition of this mammalian IAP family has 

provided emergent patterns of protein structure* 
Recognition of these patterns has also allowed us assign the 
function of a modulator of apoptosis to a drosophila gene 
product of previously unknown function (Genbank Accession 

15 Number M96581) . The amino acid sequence of this protein, 
termed diap, is shown in Fig. 8 for comparison. 

The IAP proteins may be used to inhibit the 
apoptosis which occurs as part of disease or disorder 
processes. For example, IAP polypeptides or nucleic acid 

2 0 encoding IAP polypeptides may be administered for the 

treatment of or prevention of apoptosis which occurs as a 
part of AIDS, neurodegenerative diseases, ischemic injury, 
toxin-induced liver disease and myelodysplastic syndromes. 
Nucleic acid encoding the IAP polypeptide may also be 

25 provided to inhibit apoptosis. 
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II. Cloning of IAP Genes 

A. XI AP 

Our search for human genes potentially involved in 
apoptosis has resulted in the identification of an x-linked 
5 sequence tag site (STS) in the GenBank which demonstrated 
strong homology with the conserved RZF domain of CpIAP and 
OpIAP, the two baculovirus genes known to inhibit apoptosis 
(Clem et al., Mol. Cell Biol., 14:5212-5222, (1994); and 
Birnbaum et al, J. Virol. 68:2521-8, (1994)). Screening a 

10 human fetal brain ZapII cDNA library (Stratagene, La Jolla, 
CA) with this STS resulted in the identification and cloning 
of xiap (for X- linked Inhibitor of apoptosis protein gene) . 
The human gene has a 1.7 kb coding sequence that includes 
three BIR (baculovirus inhibitor of apoptosis repeat (Crook 

15 et al., J. Virol. 67:2168-74, (1993), Clem et al., Science 
254:1388-90, (1991); and Birnbaum et al., J. Virol., 
68:2521-8, (1994)) domains and a zinc finger. Northern 
analysis with xiap reveals a greater than 7kb message 
expressed in different tissues particularly liver and kidney 

20 (Fig. 12). The large size of the transcript reflects large 
5' and 3' untranslated regions. 

B. HUMAN HIAP1 and HIAP2 

The hiapl and hiap2 genes were cloned by screening a 
human liver library (Stratagene) with a probe including the 

25 whole xiap coding region at low stringency (40°C wash, 

2xssc, 10% SDS) (Figs. 2 and 3). hiapl and hiap2 were also 
independently detected using a probe derived from a 
expressed sequence tag (EST) (GenBank Accession No. T96284) 
which includes a portion of a BIR domain. This EST was 

30 originally isolated by the PCR amplification of a cDNA 

library using the EST-specific primers. The derived probe 
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was then used to screen the human liver cDNA library for 
full length hiap coding sequences. We have subsequently 
detected a third DNA which includes the hiap2 sequence which 
appears to lack one exon, presumably due to alternative mRNA 
5 splicing (see boxed region in Fig. 3) . Figures 8 and 9 show 
hiapl and hiap2 expression in human tissues as assayed by 
Northern Analysis. 

C. M-XIAP 

Screening of a mouse embryo Xgtll cDNA library 
10 (Clonetech, Palo Alto, CA) and a mouse FIX II genomic 

library with a xiap cDNA clones probe has resulted in the 
identification of 14 positive cDNA and two hybridizing 
genomic clones. A cDNA contig spanning 8.0 kb was 
constructed using 12 overlapping mouse clones. DNA 
15 sequencing revealed a coding sequence of about 1.7 kb. The 
mouse gene called m-xiap (for mouse x-linked inhibitor of 
apoptosis protein gene) shows striking amino acid homology 
with xiap at and around the initiation methionine, the stop 
codon, the three BIR domains and the zinc finger domain. As 

2 0 with the human gene, the mouse homologue contains large 5' 

and 3' UTRs predicted to result in a transcript as large as 
7-8 kb. 

Sequencing and restriction mapping of m-xiap can be 
used to further delineate the structure and genomic 
25 organization of m-xiap. Southern blot analysis and inverse 
PCR technique (Groden et al., Cell 66:589-600 (1991) can be 
employed to map exons and sequence exon-intron boundaries. 

Antisera can be raised against a m-xiap fusion 
protein expressed in Escherichia coli using a bacterial 

3 0 expression system. The resulting antisera can be used along 

with Northern blot analysis to analyze the spatial and 
temporal expression of m-xiap in the mouse. 
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D. M-HIAP1 and M-HIAP2 

The murine homo logs to hiapl and hiap2 were cloned 
and sequenced in the same general manner as m-xiap using the 
human hiapl and hiap2 sequences as probes. Cloning of m- 
5 hiapl and m-hiap2 provide further demonstrations of the case 
with which homologs from different species may be detected 
and obtained using the techniques provided herein and those 
generally known to one skilled in the art of molecular 
biology. 

10 III. Cloning of Additional IAP Genes 

Low stringency Southern blot hybridization of human 
genomic DNA using probes specific for xiap, hiapl and hiap2 
show bands which correspond to the other known human IAP 
sequences. In addition, these probes detect sequences which 

15 do not correspond to known IAP sequences. This result 
indicates that additional IAP sequences may be readily 
identified using low stringency hybridization. Examples of 
murine and human xiap, hiapl, and hiap2 specific primers 
which may be used to clone additional genes by RT PCR are 

20 shown in Table 5. Standard techniques including PCR and 
hybridization may be used to clone homologs and additional 
genes . 

IV. Characterization of IAP Apoptos is Modulating Activity 
The apoptosis inhibiting capability of IAPs can be 

25 defined in an in vitro system know to detect alterations in 
apoptosis. Mammalian expression constructs carrying IAPs 
and their truncated forms can be introduced into various 
cell lines such as CHO, HIH 3T3, HL60, Rat-l, or Jurkart 
cells, for example. In addition, SF21 insect cells may be 

30 used in which case the IAP gene is preferentially expressed 
using an insect heat shock promotor. Apoptosis will then be 
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induced in transfected cells and controls employing standard 
methodologies (e.g. serum withdrawal and staurosporine) . A 
survival index (ratio of surviving transfected cells to 
surviving control cells) will indicate the strength of each 
5 IAP construct in inhibiting apoptosis. These experiments 
can confirm the presence of apoptosis inhibiting or 
enhancing activity and, can help to determine the minimal 
functional region of an IAP. These methods may also be used 
in combination with compounds to identify compounds which 
10 modulate apoptosis via their effect on IAP expression. 

Figs. 14A - 14D show specific examples of apoptosis 
suppression assays. Fig. 14A shows CHO survival following 
H serum withdrawal. CHO cells were transfected via 

m Lipofectace with 2 of each of the following recombinant 

111 15 plasmids; pCDNA3-6myc-hiap-l, pCDNA3-6myc-hiap-2 , pCDNA3- 

6myc-xiap, pCDNA3-6myc, pCDNA3-HA-smn, and pCDNA3-bcl-2 . 
£ Oligonucleotide primers were synthesized to allow PCR 

m amplification and cloning of the xiap, hiap-1 and hiap-2 . 

O Oligonucleotide primers were synthesized to allow PCR 

:§ 20 amplification and cloning of the xiap, hiap-1, and hiap-2 
tl ORFs in pCDNA3 (Invitrogen) . Each construct was modified to 

13 incorporate a synthetic myc tag encoding six repeats of the 

i;j peptide sequence MEQKLISEEDL allowing detection of myc-IAP 

fusion proteins via monoclonal anti-myc antiserum (Egan, et 
25 al., Nature 363:45-51, 1993). Triplicate samples of cell 
lines in 24 well dishes were washed 5 times with serum free 
media and maintained in serum free conditions during the 
course of the experiment. Trypan blue exclusion counting of 
viable cells utilizing a hemocytometer was performed on 
30 samples at time zero, 24 hrs., 48 hrs., and 72 hrs., post 
serum withdrawal. Survival was calculated as a percentage 
of initial numbers. Numbers represent the average of three 
separate experiments performed in triplicate, +/- average 
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deviation. Fig. 14B shows survival of CHO transfected cell 
lines following exposure to menadione. Cell lines were 
plated in 24 well dishes, allowed to grow overnight, then 
exposed for 1.5 hrs. to [20mM] menadione (Sigma). 
5 Triplicate samples were harvested at the time of exposure 
and at 24 hrs. post exposure and assessed by trypan blue 
exclusion for survival. Data represents the average of 
three independent experiments, +/- average deviation. Fig. 
14C shows survival of Rat-1 cells following staurosporine 

10 exposure. Rat-1 cells were transfected with the plasmids 
listed in a), with selection in [800 mg/ml] G418 media for 
two weeks. Cell lines were assessed for resistance to 
[lMM] staurosporine induced apoptosis for 5 hrs. Viable cell 
counts were obtained 24 hrs. post exposure via trypan blue 

15 exclusion counting of samples prepared in triplicate. 
Numbers represent the average of two independent 
experiments, + /- average deviation. Fig. 14D shows Rat-1 
cell lines were tested for resistance to [10 mM] menadione 
for 1.5 hrs., then counted at 18 hrs. post exposure. 

20 Numbers represent the average of three experiments performed 
in triplicate, +/- average deviation. 

Specific examples of apoptosis assays are also 
provided in the following references: 

Lvmphocvte : C.J. Li et al., "Induction of apoptosis 
25 in uninfected lymphocytes by HIV-1 Tat protein", Science, 
268:429-431 (1995); D. Gibellini et al., "Tat-expressing 
Jurkat cells show an increased resistance to different 
apoptotic stimuli, including acute human immunodeficiency 
virus-type 1 (HIV-1) infection", Br. J. Haematol. 89:24-33, 
30 (1995); S.J. Martin et al., "HIV-1 infection of human CD4+ T 
cells in vitro. Differential induction of apoptosis in 
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these cells." J. Immunol, 152:330-42, (1994); C. Terai et 
al., "Apoptosis as a mechanism of cell death in cultured T 
lymphoblasts acutely infected with HIV-1", J, Clin Invest., 
87:1710-5, (1991); J. Dhein et al., "Autocrine T-cell 
5 suicide mediated by APO-1/ (Fas/CD95) " , Nature 373:438-441, 
(1995); P.D. Katsikis et al., "Fas antigen stimulation 
induces marked apoptosis of T lymphocytes in human 
immunodeficiency virus-infected individuals", J. Exp. Med. 
1815:2029-2036, (1995); Michael 0. Westendorp et al., 
10 Sensitization of T cells to CD95-mediated apoptosis by HIV-1 
Tat and gpl20", Nature, 375:497, (1995); DeRossi et al., 
Virology 198:234-44, (1994). 

Fibroblasts : H. Vossbeck et al., "Direct 
transforming activity of TGF-beta on rat fibroblasts", Int. 

15 J. Cancer, 61:92-97, (1995); S. Goruppi et al., "Dissection 
of c-myc domains involved in S phase induction of NIH3T3 
fibroblasts", Oncogene, 9:1537-44, (1994); A. Fernandez et 
al., "Differential sensitivity of normal and Ha-ras- 
transformed C3H mouse embryo fibroblasts to tumor necrosis 

2 0 factor: induction of bcl-2, c-myc, and manganese superoxide 
dismutase in resistant cells", Oncogene, 9:2009-17, (1994); 
E. A. Harrington et al., "c-Myc-induced apoptosis in 
fibroblasts in inhibited by specific cytokines", Embo J., 
13:3286-3295, (1994); N. Itoh et al., "A novel protein 

2 5 domain required for apoptosis. Mutational analysis of human 
Fas antigen", J. Biol. Chem. , 268:10932-7, (1993). 

Neuronal Cells : G. Melino et al., "Tissue 
transglutaminase and apoptosis: sense and antisense 
transfection studies with human neuroblastoma cells", Mol. 
30 Cell. Biol., 14:6584-6596, (1994); D. M. Rosenbaum et al. , 
"Evidence for hypoxia-induced, programmed cell death of 



cultured neurons", Ann. Neurol., 36:864-870, (1994); N. Sato 
et al., "Neuronal differentiation of PC12 cells as a result 
of prevention of cell death by bcl-2", J. Neurobiol, 
25:1227-1234, (1994); G. Ferrari et al. , "N-acetylcysteine 
5 (D- and L-stereoisomers) prevents apoptotic death of 

neuronal cells", J. Neurosci., 1516:2857-2866, (1995); A. K. 
Talley et al., "Tumor necrosis factor alpha-induced 
apoptosis in human neuronal cells: protection by the 
antioxidant N-acetylcysteine and the genes bcl-2 and crmA", 

10 Mol. Cell Biol., 1585:2359-2366, (1995); A. K. Talley et 

al., "Tumor Necrosis Factor Alpha-Induced Apoptosis in Human 
Neuronal Cells: Protection by the Antioxidant N- 
Acetylcysteine and the Genes bcl-2 and crmA" , Mol. and Cell. 
Biol., 15:2359-2366, (1995); G. Walkinshaw et al., 

15 "induction of apoptosis in catecholaminergic PC12 cells by 
L-DOPA. Implications for the treatment of Parkinson's 
disease.", J. Clin. Invest. 95:2458-2464, (1995). 

Tnsect Cells : R. J. Clem et al. , "Prevention of 
apoptosis by a baculovirus gene during infection of insect 

20 cells", Science, 254:1388-90, (1991); N. E. Crook et al. , 
"An apoptosis-inhibiting baculovirus gene with a zinc 
finger-like motif", J. Virol., 67:2168-74, (1993); S. 
Rabizadeh et al. , "Expression of the baculovirus p35 gene 
inhibits mammalian neural cell death", J. Neurochem. , 

25 61:2318-21, (1993); M. J. Birnbaum et al., "An apoptosis- 
inhibiting gene from a nuclear polyhidrosis virus encoding a 
polypeptide with Cys/His seguence motifs", J. Virol, 
68:2521-8, (1994); R. J. Clem et al., "Control of programmed 
cell death by the baculovirus genes p3 5 and iap" , Mol. Cell. 

30 Biol., 14:5212-5222, (1994). 
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V. Construction of a Transgenic Animal 

Characterization of IAPs can provide information 
that allows for the development of an IAP knockout animal 
model, preferably mammal, most preferably a mouse, by 
5 homologous recombination. Similarily, an IAP overproducing 
animal may be produced by means of DNA sequence integration 

into the genome. 

A replacement type targeting vector to create a 
knockout can be constructed using an isogenic genomic clone 

10 from a mouse strain, e.g. 129/Sv (Strategene LaJolla, CA) . 
The targeting vector will be introduced into a Jl line of 
embryonic stem (ES) cells by electroporation to generate ES 
cell lines that carry a profoundly truncated form of an IAP. 
To generate chimeric founder mice, the targeted cell lines 

15 will be injected into a mouse blastula stage embryo. 

Heterozygote offspring will be interbred to homozygosity. 
Knockout mice may be constructed as a means of screening in 
vivo for therapeutic compounds which modulate apoptosis. 

Animals having enhanced IAP expression may also be 

20 constructed using standard transgenic technologies. 

VI. IAP Protein Expression 

IAP genes may be expressed in both prokaryotic and 
eukaryotic cell types. For those IAP's which increase 
apoptosis it may be desirable to express the protein under 
25 control of an inducible promotor for the purposes of proteir 
production . 

In general, IAP proteins according to the invention 
may be produced by transformation of a suitable host cell 
with all or part of a IAP-encoding cDNA fragment (e.g., the 
30 cDNA described above) in a suitable expression vehicle. 

Those skilled in the field of molecular biology will 
understand that any of a wide variety of expression systems 



may be used to provide the recombinant protein. The precise 
host cell used is not critical to the invention. The IAP 
protein may be produced in a prokaryotic host (e.g., E. 
coli) or in a eukaryotic host (e.g., Saccharomyces 
5 cerevisiae, insect cells, e.g., Sf2l cells, or mammalian 

cells, e.g., COS 1, NIH 3T3, or HeLa cells). Such cells are 
available from a wide range of sources (e.g., the American 
Type Culture Collection, Rockland, MD; also, see, e.g., 
Ausubel et al. 7 Current Protocols in Molecular Biology . John 

10 Wiley & Sons, New York, 1994) . The method of transformation 
or transfection and the choice of expression vehicle will 
depend on the host system selected. Transformation and 
transfection methods are described, e.g., in Ausubel et al., 
( supra ) ; expression vehicles may be chosen from those 

15 provided, e.g., in Cloning Vectors: A Laboratory Manual 
(P.H. Pouwels et al., 1985, Supp. 1987). 

One preferred expression system is the baculovirus 
system (using, for example, the vector pBacPAK9) available 
from Clontech (Palo Alto, CA) . If desired, this system may 

2 0 be used in conjunction with other protein expression 

techniques, for example, the myc tag approach described by 
Evan et al. (Mol. Cell Biol. 5:3610-3616, 1985). 

Alternatively, a IAP protein is produced by a 
stably-transfected mammalian cell line. A number of vectors 
25 suitable for stable transfection of mammalian cells are 

available to the public, e.g., see Pouwels et al. ( supra ) ? 
methods for constructing such cell lines are also publicly 
available, e.g., in Ausubel et al. ( supra ) . In one example, 
cDNA encoding the IAP protein is cloned into an expression 

3 0 vector which includes the dihydrof olate reductase (DHFR) 

gene. Integration of the plasmid and, therefore, the IAP 
protein-encoding gene into the host cell chromosome is 
selected for by inclusion of 0.01-3 00 /xM methotrexate in the 



cell culture medium (as described in Ausubel et al., supra ) . 
This dominant selection can be accomplished in most cell 
types. Recombinant protein expression can be increased by 
DHFR-mediated amplification of the transfected gene. 
5 Methods for selecting cell lines bearing gene amplifications 
are described in Ausubel et al. ( supra ) ; such methods 
generally involve extended culture in medium containing 
gradually increasing levels of methotrexate. 
DHFR-containing expression vectors commonly used for this 

10 purpose include pCVSEII-DHFR and pAdD2 6SV(A) (described in 
Ausubel et al., supra ) . Any of the host cells described 
above or, preferably, a DHFR-def icient CHO cell line (e.g., 
CHO DHFR" cells, ATCC Accession No. CRL 9096) are among the 
host cells preferred for DHFR selection of a stably- 

15 transfected cell line or DHFR-mediated gene amplification. 

Once the recombinant IAP protein is expressed, it is 
isolated, e.g., using affinity chromatography. In one 
example, an anti-IAP protein antibody (e.g., produced as 
described herein) may be attached to a column and used to 

2 0 isolate the IAP protein. Lysis and fractionation of IAP 

protein-harboring cells prior to affinity chromatography may 
be performed by standard methods (see, e.g., Ausubel et al., 
su pra ) . 

Once isolated, the recombinant protein can, if 
25 desired, be further purified, e.g., by high performance 
liquid chromatography (see, e.g., Fisher, Laboratory 
Techniques In Biochemistry And Molecular Biology , eds., Work 
and Burdon, Elsevier, 1980) . 

Polypeptides of the invention, particularly short 

3 0 IAP protein fragments, can also be produced by chemical 

synthesis (e.g., by the methods described in Solid Phase 
Peptide Synthesis , 2nd ed. , 1984 The Pierce Chemical Co., 
Rockford, IL) . 



These general techniques of polypeptide expression 
and purification can also be used to produce and isolate 
useful IAP fragments or analogs (described herein) . 

VI. Anti-IAP Antibodies 

5 To generate IAP-specific antibodies, a IAP coding 

sequence (i.e., amino acids 180-276) can be expressed as a 
C-terminal fusion with glutathione S-transf erase (GST) 
(Smith et al., Gene 67:31-40, 1988). The fusion protein can 
be purified on glutathione-Sepharose beads, eluted with 
10 glutathione cleaved with thrombin (at the engineered 

cleavage site) , and purified to the degree necessary for 
y3 immunization of rabbits. Primary immunizations can be 

carried out with Freund's complete adjuvant and subsequent 
j£ immunizations with Freund's incomplete adjuvant. Antibody 

"J 15 titres are monitored by Western blot and immunoprecipitation 
7*, analyses using the thrombin-cleaved IAP protein fragment of 

£ the GST-IAP fusion protein. Immune sera are affinity 

U purified using CNBr-Sepharose-coupled IAP protein, 

fi Antiserum specificity is determined using a panel of 

H 2 0 unrelated GST proteins (including GSTp53, Rb, HPV-16 E6, and 
]Z E6-AP) and GST-trypsin (which was generated by PGR using 

known sequences) . 

As an alternate or adjunct immunogen to GST fusion 
proteins, peptides corresponding to relatively unique 

2 5 hydrophilic regions of IAP may be generated and coupled to 

keyhole limpet hemocyanin (KLH) through an introduced 
C-terminal lysine. Antiserum to each of these peptides is 
similarly affinity purified on peptides conjugated to BSA, 
and specificity tested in ELISA and Western blots using 

3 0 peptide conjugates, and by Western blot and 

immunoprecipitation using IAP expressed as a GST fusion 
protein. 
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Alternatively, monoclonal antibodies may be prepared 
using the IAP proteins described above and standard 
hybridoma technology (see, e.g., Kohler et al., Nature 
256:495, 1975; Kohler et al., Eur. J. Immunol. 6:511, 1976; 
5 Kohler et al. , Eur. J. Immunol. 6:292, 1976; Hammerling et 
al., In Monoclonal Antibodies and T Cell Hybridomas, 
Elsevier, NY, 1981; Ausubel et al., supra ) . Once produced, 
monoclonal antibodies are also tested for specific IAP 
recognition by Western blot or immunoprecipitation analysis 

10 (by the methods described in Ausubel et al., supra ) . 

Antibodies which specifically recognize IAP are considered 
to be useful in the invention; such antibodies may be used, 
e.g., in an immunoassay to monitor the level of IAP produced 
by a mammal (for example, to determine the amount or 

15 subcellular location of IAP) . 

Preferably, antibodies of the invention are produced 
using fragments of the IAP protein which lie outside highly 
conserved regions and appear likely to be antigenic, by 
criteria such as those provided by the Peptidestructure 

2 0 program of the Genetics Computer Group Sequence Analysis 

Package (Program Manual for the GCG Package, Version 7, 
1991) using the algorithm of Jameson and Wolf (CABIOS 4:181 
1988)), Specifically these regions, which are found between 
BIRl and BIP2 of all the IAP proteins, are in hiapl from 
25 amino acid 99 to 170, hiap2 from amino acid 123 to 184, xiap 
from 116 to 133 and m-xiap from 116 to 133. In one specific 
example, such fragments are generated by standard techniques 
of PCR and cloned into the pGEX expression vector (Ausubel 
et al., supra ) , Fusion proteins are expressed in EL. coli 

3 0 and purified using a glutathione agarose affinity matrix as 

described in Ausubel et al. ( supra ) . To attempt to minimize 
the potential problems of low affinity or specificity of 
antisera, two or three such fusions are generated for each 
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protein, and each fusion is injected into at least two 
rabbits. Antisera are raised by injections in a series, 
preferably including at least three booster injections. 

VII. Identification of Molecules that Modulate TAP Protein 
5 Expression 

Isolation of the IAP cDNAs also facilitates the 
identification of molecules which increase or decrease IAP 
expression. According to one approach, candidate molecules 
are added at varying concentrations to the culture medium of 

10 cells expressing IAP mRNA. IAP expression is then measured, 
for example, by standard Northern blot analysis (Ausubel et 
al., supra ) using a IAP cDNA (or cDNA fragment) as a 
hybridization probe (see also Table 5) . The level of IAP 
expression in the presence of the candidate molecule is 

15 compared to the level measured for the same cells in the 
same culture medium but in the absence of the candidate 
molecule. 

If desired, the effect of candidate modulators on 
expression may, in the alternative, be measured at the level 

2 0 of IAP protein production using the same general approach 
and standard immunological detection techniques, such as 
Western blotting or immunoprecipitation with a IAP-specific 
antibody (for example, the IAP antibody described herein) . 
Candidate modulators may be purified (or 

25 substantially purified) molecules or may be one component of 
a mixture of compounds (e.g. , an extract or supernatant 
obtained from cells; Ausubel et al., supra ) . In a mixed 
compound assay, IAP expression is tested against 
progressively smaller subsets of the candidate compound pool 

30 (e.g., produced by standard purification techniques, e.g., 
HPLC or FPLC) until a single compound or minimal compound 
mixture is demonstrated to modulate IAP expression. 
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Alternatively, or in addition, candidate compounds 
may be screened for those which modulate IAP apoptosis 
inhibiting activity. In this approach, the degree of 
apoptosis in the presence of a candidate compound is 
5 compared to the degree of apoptosis in its absence, under 
equivalent conditions. Again, such a screen may begin with 
a pool of candidate compounds, from which one or more useful 
modulator compounds are isolated in a step-wise fashion. 
Apoptosis activity may be measured by any standard assay, 
10 for example, those described herein. 

Another method for detecting compounds which 
modulate IAP polypeptide activity is to screen for compounds 
which physically interact with a given IAP polypeptide. 
Such compounds may be detected using adaptations of the 
15 interaction trap expression systems known in the art. Such 
systems detect protein interactions using a transcriptional 
activation assay and are generally described in Gyuris et 
al., Cell 75:791-803 (1993), and Field and Song, Nature 
340:245-246, (1989), and are commercially available from 
20 Clonetech (Palo Alto, CA) . In addition, PCT Publication WO 
95/28497 (hereby incorporated by reference) describe a 
method for detecting proteins involved in apoptosis by 
virtue of their interaction with Bcl-2 using such an 
interaction trap assay. A similar method may be exploited 
25 to identify proteins and other compounds which interact with 
the IAP polypeptides. 

Candidate IAP modulators include peptide as well as 
non-peptide molecules (e.g., peptide or non-peptide 
molecules found, e.g., in a cell extract, mammalian serum, 
30 or growth medium on which mammalian cells have been 
cultured) . 

A molecule which promotes an increase in IAP 
expression or IAP activity is considered particularly useful 
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in the invention; such a molecule may be used, for example, 
as a therapeutic to increase cellular levels of IAP and 
thereby exploit the effect of IAP polypeptides for the 
inhibition of apoptosis. 

A molecule which decreases IAP activity (e.g., by 
decreasing gene expression or polypeptide activity) may be 
useful for decreasing cell proliferation. Such uses include 
treatment of neoplasms (see Table 3, below) or other cell 
proliferative diseases. 

Modulators found to be effective at the level of IAP 
expression or activity may be confirmed as useful in animal 
models and, if successful, may be used as anti-cancer 
therapeutics for either the inhibition or the enhancement of 
apoptosis, as appropriate. 

IX. IAP Therapy 

Because expression levels of IAP genes correlates 
with the levels of apoptosis, the IAP gene also finds use in 
anti-apoptosis gene therapy. In particular, to sustain 
neuronal cells, lymphocytes (T-cells and B-cells) , or cells 
exposed to ischemic injury, a functional IAP gene may be 
introduced into cells at the sites predicted to undergo 
undesirable apoptosis. 

Retroviral vectors, adenoviral vectors, adeno- 
associated viral vectors, or other viral vectors with the 
appropriate tropism for cells likely to be involved in 
apoptosis (for example, epithelial cells) may be used as a 
gene transfer delivery system for a therapeutic IAP gene 
construct. Numerous vectors useful for this purpose are 
generally known (Miller, Human Gene Therapy 15-14, 1990; 
Friedman, Science 244:1275-1281, 1989; Eglitis and Anderson, 
BioTechniques 6:608-614, 1988; Tolstoshev and Anderson, 
Current Opinion in Biotechnology 1:55-61, 1990; Sharp, The 



Lancet 337:1277-1278, 1991; Cornetta et al. , Nucleic Acid 
Research and Molecular Biology 36:311-322, 1987; Anderson, 
Science 226:401-409, 1984; Moen, Blood Cells 17:407-416, 
1991; and Miller and Rosman, Biotechniques 7:980-990, 1989; 
5 Le Gal La Salle et al., Science 259:988-990, 1993; and 

Johnson, Chest 107:77S-83S, 1995). Retroviral vectors are 
particularly well developed and have been used in clinical 
settings (Rosenberg et al., N. Engl. J. Med 323:370, 1990; 
Anderson et al., U.S. Pat. No. 5,399,346). 

10 Non-viral approaches may also be employed for the 

introduction of therapeutic DNA into cells otherwise 
predicted to undergo apoptosis. For example, IAP may be 
introduced into a neuronal cell or a T-cell by the 
techniques of lipofection (Feigner et al., Proc. Natl. Acad. 

15 Sci. USA 84:7413, 1987; Ono et al., Neuroscience Lett 

117:259, 1990; Brigham et al., Am. J. Med. Sci. 298:278, 
1989; Staubinger and Papahadjopoulos, Meth. Enz. 101:512, 
1983); asialorosonucoid-polylysine conjugation (Wu and Wu, 
J. Biol. Chem. 263:14621, 1988; Wu et al., J. Biol. Chem. 

20 264:16985, 1989); or, less preferably, microinjection under 
surgical conditions (Wolff et al., Science 247:1465, 1990). 

For any of the above approaches, the therapeutic IAP 
DNA construct is preferably applied to the site of the 
predicted apoptosis event (for example, by injection) , but 

25 may also be applied to tissue in the vicinity of the 
predicted apoptosis event or even to a blood vessel 
supplying the cells predicted to undergo apoptosis. 

In the gene therapy constructs, IAP cDNA expression 
is directed from any suitable promoter (e.g., the human 

3 0 cytomegalovirus, simian virus 40, or metallothionein 

promoters) , and its production is regulated by any desired 
mammalian regulatory element. For example, if desired, 
enhancers known to direct preferential gene expression in 



neural cells or T-cells may be used to direct IAP 
expression* Such enhancers include, without limitation, 
those enhancers which are characterized as tissue or cell 
specific in their expression* 
5 Alternatively, if a IAP genomic clone is utilized as 

a therapeutic construct (for example, following its 
isolation by hybridization with the IAP cDNA described 
above) , IAP expression is regulated by its cognate 
regulatory sequences or, if desired, by regulatory sequences 

10 derived from a heterologous source, e.g., any of the 
promoters or regulatory elements described above. 

Less preferably, IAP gene therapy is accomplished by 
direct administration of the IAP mRNA to a cell predicted to 
undergo apoptosis. This mRNA may be produced and isolated 

15 by any standard technique, but is most readily produced by 
in vitro transcription using a IAP cDNA under the control of 
a high efficiency promoter (e.g., the T7 promoter). 
Administration of IAP mRNA to malignant cells is carried out 
by any of the methods for direct nucleic acid administration 

2 0 described above. 

Ideally, the production of IAP protein by any gene 
therapy approach described above results in a cellular level 
of IAP that is at least equivalent to the normal, cellular 
level of IAP in an unaffected individual. Treatment by any 
25 IAP-mediated gene therapy approach may be combined with more 
traditional therapies. 

Another therapeutic approach included within the 
invention involves direct administration of recombinant IAP 
protein, either to the site of a predicted apoptosis event 

3 0 (for example, by injection) or systemically by any 

conventional recombinant protein administration technique. 
The actual dosage of IAP depends on a number of factors, 
including the size and health of the individual patient, 



# 



but, generally, between O.lmg and lOOmg inclusive are 
administered per day to an adult in any pharmaceutically- 
acceptable formulation. 



5 



Administration of IAP polypeptides , IAP genes, or 
modulators of TAP synthesis or function 



A IAP protein, gene, or modulator may be 



administered with a pharmaceutically-acceptable diluent, 
carrier, or excipient, in unit dosage form. Conventional 
pharmaceutical practice may be employed to provide suitable 

10 formulations or compositions to administer IAP to patients 
suffering from or presymptomatic for a IAP-associated 
carcinoma. Any appropriate route of administration may be 
employed, for example, parenteral, intravenous, 
subcutaneous , intramuscular , intracranial , intraorbital , 

15 ophthalmic, intraventricular, intracapsular, intraspinal, 
intracisternal, intraperitoneal, intranasal, aerosol, or 
oral administration. Therapeutic formulations may be in the 
form of liquid solutions or suspensions; for oral 
administration, formulations may be in the form of tablets 

20 or capsules; and for intranasal formulations, in the form of 
powders, nasal drops, or aerosols. 



formulations are found in, for example, "Remington's 
Pharmaceutical Sciences." Formulations for parenteral 

25 administration may, for example, contain excipients, sterile 
water, or saline, polyalkylene glycols such as polyethylene 
glycol, oils of vegetable origin, or hydrogenated 
napthalenes* Biocompatible, biodegradable lactide polymer, 
lactide/glycolide copolymer, or polyoxyethylene- 

3 0 polyoxypropylene copolymers may be used to control the 
release of the compounds. Other potentially useful 
parenteral delivery systems for IAP modulatory compounds 



Methods well known in the art for making 
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include ethylene-vinyl acetate copolymer particles, osmotic 
pumps , implantable infusion systems, and liposomes* 
Formulations for inhalation may contain excipients, for 
example , lactose, or may be aqueous solutions containing, 
5 for example, polyoxyethylene-9-lauryl ether, glycocholate 
and deoxycholate, or may be oily solutions for 
administration in the form of nasal drops, or as a gel. 

If desired, treatment with a IAP protein, gene, or 
modulatory compound may be combined with more traditional 
10 therapies for the disease such as surgery, radiation, or 
chemotherapy for cancers; surgery, steroid therapy, and 
chemotherapy for autoimmune diseases; antiviral therapies 
for AIDS; and for example, TPA for ischemic injury, 

XI. Detection of A Condition Involving Altered Apoptosis 

15 IAP polypeptides and nucleic acid sequences find 

diagnostic use in the detection or monitoring of conditions 
involving aberrant levels of apoptosis. For example, 
decrease expression of IAP may be correlated with enhanced 
apoptosis in humans (see XII, below). Accordingly, a 

2 0 decrease or increase in the level of IAP production may 

provide an indication of a deleterious condition. Levels of 
IAP expression may be assayed by any standard technique. 
For example, its expression in a biological sample (e.g., a 
biopsy) may be monitored by standard Northern blot analysis 
25 or may be aided by PGR (see, e.g., Ausubel et al., supra ; 
PCR Technology: Principles and Applications for DNA 
Amplification , ed. , H.A. Ehrlich, Stockton Press, NY; and 
Yap and McGee, Nucl. Acids. Res. 19:4294, 1991). 

Alternatively, a patient sample may be analyzed for 

3 0 one or more mutations in the IAP sequences using a mismatch 

detection approach. Generally, these techniques involve PCR 
amplification of nucleic acid from the patient sample, 



followed by identification of the mutation (i.e., mismatch) 
by either altered hybridization, aberrant electrophoretic 
gel migration, binding or cleavage mediated by mismatch 
binding proteins, or direct nucleic acid sequencing. Any of 
5 these techniques may be used to facilitate mutant IAP 

detection, and each is well known in the art; examples of 
particular techniques are described, without limitation, in 
Orita et al., Proc. Natl. Acad. Sci. USA 86:2766-2770, 
(1989); and Sheffield et al., Proc. Natl. Acad. Sci. USA 

10 86:232-236, (1989) . 

In yet another approach, immunoassays are used to 
detect or monitor IAP protein in a biological sample. IAP- 
specific polyclonal or monoclonal antibodies (produced as 
described above) may be used in any standard immunoassay 

15 format (e.g., ELISA, Western blot, or RIA assay) to measure 
IAP polypeptide levels; again comparison is to wild-type IAP 
levels, and a decrease in IAP production is indicative of a 
condition involving increased apoptosis. Examples of 
immunoassays are described, e.g., in Ausubel et al., supra . 

20 Immunohistochemical techniques may also be utilized for IAP 
detection. For example, a tissue sample may be obtained 
from a patient, and a section stained for the presence of 
IAP using an ant i- IAP antibody and any standard detection 
system (e.g., one which includes a secondary antibody 

25 conjugated to horseradish peroxidase) . General guidance 
regarding such techniques can be found in, e.g., Bancroft 
and Stevens ( Theory and Practice of Histological Techniques , 
Churchill Livingstone, 1982) and Ausubel et al. (supra) . 
In one preferred example, a combined diagnostic 

3 0 method may be employed that begins with an evaluation of IAP 
protein production (for example, by immunological techniques 
or the protein truncation test (Hogerrorst, F.B.L., et al., 
Nature Genetics 10:2 08-212 (1995) and also includes a 
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nucleic acid-based detection technique designed to identify 
more subtle IAP mutations (for example, point mutations) • 
As described above, a number of mismatch detection assays 
are available to those skilled in the art, and any preferred 
5 technique may be used (see above) . By this approach, 

mutations in IAP may be detected that either result in loss 
of IAP expression or loss of IAP biological activity. In a 
variation of this combined diagnostic method, IAP biological 
activity is measured as protease activity using any 

10 appropriate protease assay system (for example, those 
described above) . 

Mismatch detection assays also provide the 
opportunity to diagnose a IAP-mediated predisposition to 
diseases of apoptosis. For example, a patient heterozygous 

15 for an IAP mutation may show no clinical symptoms and yet 
possess a higher than normal probability of developing one 
or more types of neurodegenerative, myelodysplastic or 
ischemic diseases. Given this diagnosis, a patient may take 
precautions to minimize their exposure to adverse 

20 environmental factors (for example, UV exposure or chemical 
mutagens) and to carefully monitor their medical condition 
(for example, through frequent physical examinations) . This 
type of IAP diagnostic approach may also be used to detect 
IAP mutations in prenatal screens* 

25 The IAP diagnostic assays described above may be 

carried out using any biological sample (for example, any 
biopsy sample or bodily fluid or tissue) in which IAP is 
normally expressed (for example, the inhibition of 
apoptosis) . Identification of a mutant IAP gene may also be 

3 0 assayed using these sources for test samples. 

Alternatively, a IAP mutation, particularly as part of a 
diagnosis for predisposition to IAP-associated degenerative 
disease, may be tested using a DNA sample from any cell, for 
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example, by mismatch detection techniques; preferably , the 
DNA sample is subjected to PCR amplification prior to 
analysis. 

To demonstrate the utility of IAP gene sequences as 
5 diagnostics and prognostics for cancer we probed the 

Clonetech (La Jolla) Human Cancer Cell Line Multiple Tissue 
Northern Blot (#7757-1). As Table 3 shows, all cancer lines 
tested showed increased IAP expression relative to samples 
from non-cancerous control cell lines. xiap expression was 
10 particularly high in HeLa (S-3) , chronic myelogenous 

leukemia (K-562) , colorectal adenocarcinoma (SW-480) and 
melanoma (G-3 61) lines, hiapl expression was extremely high 
in Burkitt's lymphoma and was also elevated in colorectal 
qi adenocarcinoma. hiap2 expression was particularly high in 

~j 15 chronic myelogenous leukemia (K-562) and colorectal 
'%! adenocarcinoma (SW-480) . 

In addition , we note that we have mapped hiapl and 
hiap2 to human chromosome llg23. This is a known hotspot 
Q for cancer causing mutations. 
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TABLE 3 

Northern Blot IAP RNA levels in Cancer Cells* 





xiap 


hiapl 


hiap2 


Promylocytic Leukemia HL-60 


+ 


4- 


+ 


Hela S-3 


+ 


+ 


+ 


Chronic Myelogenous Leukemia 
K-562 


+++ 


+ 


+++ 


Lymphoblastic Leukemia MDLT- 
4 


+++ 




+ 


Burkitt's Lymphoma Raji 


+ 


+(xlO) 




Colorectal Adenocarcinoma 
SW-480 


+++ 


+++ 


+++ 


Lung Carcinoma A-549 




+ 


+ 


Melanoma G-3 61 


+++ 


+ 


+ 



* Levels are indicated by a (+) and are the approximate 
increase in RNA levels relative to Northern blots of RNA 
from non-cancerous control cell lines. A single plus 
indicates an estimated increase of at least 1-fold 



XII, Treatment of HIV Infected Individuals 

We have found that hiapl and hiap 2 expression is 
decreased significantly in HIV infected human cells. This 
decrease precedes apoptosis. The result indicates that 
administration of MAPI, HIAP2, genes encoding these 
proteins, or compounds which upregulate these genes can be 
used to prevent T-cell attrition in HIV infected patients. 
The following assay may also be used to screen for compounds 



which alter hiapl and hiap2 expression and which also 
prevent apoptosis. 

The experiments were preformed as follows: Cultured 
mature lymphocyte CD-4 + T-cell lines (H9 labelled "a"; 
5 CEM/CM-3 labelled "b"; 6T-CEM labelled »c M ; and Jurkat 
labelled "d" in Figs. 13A and 13B) were examined for 
apoptosis (Fig. 13A) and hiap gene expression (Fig. 13B) . 
Control conditions are labelled as lane 1 in Fig. 13A and 
Fig. 13B. Lane 2 shows the result 24 hours after PHA/PMH 

10 (phytohemagglutinin, phorbol ester) mitogen stimulation. 
Lane 3 shows the result 2 4 hours after HIV strain III B 
infection. The "M" refers to standard DNA markers , the 123 
bp ladder (Gibco-BRL) in Fig. 13B, and lambda Hindlll ladder 
(Gibco-BRL) in Fig. A. 

15 In Fig. 13A is a picture of ethidium bromide stained 

gel showing the presence of DNA ladders (as assayed by the 
test of Prigent et al., J. of Immun. Methods, 160:139-140, 
(1993), indicative of apoptosis. The sensitivity and degree 
of apoptosis of the four T-cell lines varies following 

2 0 mitogen stimulation and HIV infection. 

For the experiment examining hiap gene expression, 
total RNA was prepared from the cultured cells and subject 
to a reverse transcriptase reaction using oligo-dT priming. 
The RT cDNA products were PCR amplified using specific 
25 primers (as shown in Table 5) for the detection of hiap2a, 
hiap2b and hiap 1. PCR conditions were routine (94°C 
melting for 1 minute, 55°C annealing for 2 minutes and 72 °C 
extension for 1.5 minutes for 35 cycles) using a Perkin- 
Elmer 480 thermocycler . The Fig. 13B shows a picture of the 

3 0 RT-PCR products run on a 1% agarose gel stained with 

ethidium bromide. Absence of hiap2 transcripts is noted in 
all four cell lines 24 hours after HIV infection. In three 
of four cell lines (all except H9) , the hiapl gene is also 



dramatically down-regulated after HIV infection. PHA/PMA 
mitogen stimulation also appears to decrease hiap gene 
expression, particularly for hiap2 and to a lesser extent, 
for hiapl. 

5 The data from these experiments is summarized in the 

accompanying Table 5* The 6-action gene expression was 
consistent in all cell lines tested, indicating that a flow 
in the RT-PCR assay does not account for the decreases in 
hiap gene expression. 
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Table 4 

Oligonucleotide primers for the specific RT-PCR 
amplification of unique IAP genes ♦ 



TAP Hon« 


rurwaro rnnicr 
(nucleotide position*) 


rxcvciac r rimer 

(nucleotide position*) 




h-xiap 


p2415 (876-896) 


p2449 (1291-1311) 


435 


m-xiap 


p2566 (458-478) 


p2490 (994-1013) 


555 


h-hiapl 


p246S (827-847) 


p2464 (1008-1038) 


211 


m-hiapl 


p2687 (747-767) 


p2684 (1177-1197) 


450 


hiap2 


p2595 (1562-1585) 


p2578 (2339-2363) 


S01 a 
618 b 


m-hiap2 


p2693 (1751-1772) 


p2734 (2078-2100) 


349 



* Nucleotide position as determined from Figs. 1-4 for each IAP gene 
a PCR product size of hiap2a 



PCR product size of hiap2b 




Table 5 

Apoptosis and hiap gene expression in cultured T-cells 
following mitogen stimulation or HIV infection- 



Cell Line 


Condition 


Apoptosis 


hiapl 


hiap2 


H9 


not stimulated 






+ /- 




PHA/PMA stimulated 


+ + + 




+/- 




HIV infected 


+ + 






CEM/CM-3 


not stimulated 




+ 






PHA/PMA stimulated 


+/- 


+ 






HIV infected 


+/- 






6T-CEM 


not stimulated 




+ 


+ 




PHA/PMA stimulated 


+/- 








HIV infected 


+ 






Jurkat 


not stimulated 




+ 


+ + 




PHA/PMA stimulated 


+ 


+ 


+ 




HIV infected 


+/- 







XIII. Preventive Anti-Apoptotic Therapy 

10 In a patient diagnosed to be heterozygous for an IAP 

mutation or to be susceptible to IAP mutations (even if 
those mutations do not yet result in alteration or loss of 
IAP biological activity) , or a patent diagnosed as HIV 
positive, any of the above therapies may be administered 

15 before the occurrence of the disease phenotype. For 

example, the therapies may be provided to a patient who is 
HIV positive but does not yet show a diminished T-cell count 
or other signs of full-blown AIDS. In particular, compounds 
shown to increase IAP expression or IAP biological activity 

20 may be administered by any standard dosage and route of 
administration (see above) . Alternatively, gene therapy 
using an IAP expression construct may be undertaken to 
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reverse or prevent the cell defect prior to the development 
of the degenerative disease. 

The methods of the instant invention may be used to 
reduce or diagnose the disorders described herein in any 
5 mammal, for example, humans, domestic pets, or livestock ♦ 
Where a non-human mammal is treated or diagnosed, the IAP 
polypeptide, nucleic acid, or antibody employed is 
preferably specific for that species. 

Other Embodiments 
10 In other embodiments, the invention includes any 

^ protein which is substantially identical to a mammalian IAP 

•II polypeptides (Figs. 1-6; SEQ ID NO: 1-42); such homologs 

Hi include other substantially pure naturally-occurring 

i mammalian IAP proteins as well as allelic variants; natural 

M 15 mutants; induced mutants; DNA sequences which encode 

proteins and also hybridize to the IAP DNA sequences of 
* Figs. 1-6 (SEQ ID NOS:l-42) under high stringency conditions 

y or, less preferably, under low stringency conditions (e.g. , 

n washing at 2X SSC at 40°C with a probe length of at least 40 

H 20 nucleotides) ; and proteins specifically bound by antisera 
2~ directed to a IAP polypeptide. The term also includes 

chimeric polypeptides that include a IAP portion. 

The invention further includes analogs of any 
naturally-occurring IAP polypeptide. Analogs can differ 
25 from the naturally-occurring IAP protein by amino acid 

sequence differences, by post-translational modifications, 
or by both. Analogs of the invention will generally exhibit 
at least 85%, more preferably 90%, and most preferably 95% 
or even 99% identity with all or part of a naturally- 
3 0 occurring IAP amino acid sequence. The length of sequence 
comparison is at least 15 amino acid residues, preferably at 
least 25 amino acid residues, and more preferably more than 
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35 amino acid residues. Modifications include in vivo and 
in vitro chemical derivatization of polypeptides, e.g., 
acetylation, carboxylation, phosphorylation, or 
glycosylation; such modifications may occur during 
5 polypeptide synthesis or processing or following treatment 
with isolated modifying enzymes. Analogs can also differ 
from the naturally-occurring IAP polypeptide by alterations 
in primary sequence. These include genetic variants, both 
natural and induced (for example, resulting from random 

10 mutagenesis by irradiation or exposure to 

ethanemethylsulf ate or by site-specific mutagenesis as 
described in Sambrook, Fritsch and Maniatis, Molecular 
Cloning: A Laboratory Manual (2d ed.) , CSH Press, 1989, or 
Ausubel et al., supra). Also included are cyclized 

15 peptides, molecules, and analogs which contain residues 
other than L-amino acids, e.g., D-amino acids or non- 
naturally occurring or synthetic amino acids, e.g., 8 or y 
amino acids. 

In addition to full-length polypeptides, the 

20 invention also includes IAP polypeptide fragments. As used 
herein, the term "fragment," means at least 20 contiguous 
amino acids, preferably at least 3 0 contiguous amino acids, 
more preferably at least 50 contiguous amino acids, and most 
preferably at least 60 to 80 or more contiguous amino acids. 

25 Fragments of IAP polypeptides can be generated by methods 
known to those skilled in the art or may result from normal 
protein processing (e.g., removal of amino acids from the 
nascent polypeptide that are not required for biological 
activity or removal of amino acids by alternative mRNA 

3 0 splicing or alternative protein processing events) . 

Preferable fragments or analogs according to the 
invention are those which facilitate specific detection of a 
IAP nucleic acid or amino acid sequence in a sample to be 



# # 

diagnosed. Particularly useful IAP fragments for this 
purpose include, without limitation, the amino acid 
fragments shown in Table 2 . 

All publications and patent applications mentioned 
5 in this specification are herein incorporated by reference 
to the same extent as if each independent publication or 
patent application was specifically and individually 
indicated to be incorporated by reference. 
What is claimed is: 
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Claims 



1 1. Substantially pure nucleic acid encoding an IAP 

2 polypeptide. 

1 2. The nucleic acid of claim 1, wherein said 

2 polypeptide comprises a ring zinc finger domain and at least 

3 one BIR domain. 

1 3. The nucleic acid of claim 2, wherein said 

2 polypeptide has at least two BIR domains. 

pi 4. The nucleic acid of claim 3, wherein said 

2 polypeptide has at least three BIR domains. 

45 1 5. The nucleic acid of claim 1, wherein said DNA 

2 2 contains the xiap gene, 

jL 1 6. The nucleic acid of claim 1, wherein said DNA 

i..J 

2 contains the hiap2 gene, 

J l 7. The nucleic acid of claim 1, wherein said DNA 

Q 2 contains the hiapl gene. 

1 8. The nucleic acid of claim l, wherein said 

2 nucleic acid is genomic DNA. 

1 9. The nucleic acid of claim 1, wherein said 

2 nucleic acid is cDNA. 

1 10. The nucleic acid of claim 1, wherein said 

2 nucleic acid is mammalian DNA. 
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1 11. The nucleic acid of claim 10, wherein said 

2 mammalian DNA is human DNA. 

1 12. The nucleic acid of claim 10, wherein said 

2 mammalian DNA is murine DNA. 

1 13. Substantially pure DNA having the sequence of 

2 Fig. 5, or degenerate variants thereof, and encoding the 

3 amino acid sequence of Fig. 5. 

1 14. Substantially pure DNA having the sequence of 

2 Fig. 6, or degenerate variants thereof, and encoding the 
□ 3 amino acid sequence of Fig. 6. 

[fi 1 15. Substantially pure DNA having about 50% or 

P 2 greater nucleotide sequence identity to the DNA sequence of 

J 3 Fig. 5. 

1 16. Substantially pure DNA having about 50% or 

O 2 greater nucleotide sequence identity to the DNA sequence of 

3 3 Fig. 6. 

3 1 17. A purified DNA sequence substantially identical 

2 to the DNA sequence shown in Fig. 5. 

1 18. A purified DNA sequence substantially identical 

2 to the DNA sequence shown in Fig. 6. 

1 19. A substantially pure mammalian IAP polypeptide. 

1 20. The polypeptide of claim 19, wherein said 

2 polypeptide is the murine HIAP1 polypeptide. 
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1 21. The polypeptide of claim 19, wherein said 

2 polypeptide is the murine HIAP2 polypeptide. 

1 22, The polypeptide of claim 19, comprising an 

2 amino acid sequence substantially identical to an amino acid 

3 sequence shown in Fig, 5. 

1 23. The polypeptide of claim 19, comprising an 

2 amino acid sequence substantially identical to an amino acid 

3 sequence shown in Fig. 6. 

1 24. A therapeutic composition comprising as an 

2 active ingredient an IAP polypeptide according to claim 19, 

3 said active ingredient being formulated in a physiologically 

4 acceptable carrier. 

1 25. A method of inhibiting apoptosis in a mammal, 

2 said method comprising: 

3 providing a cell of said mammal with a transgene 

4 encoding an IAP polypeptide, said DNA positioned for 

5 expression in said cell. 

1 26. The method of claim 25 wherein said polypeptide 

2 is murine HIAP1. 

1 27. The method of claim 25 wherein said polypeptide 

2 is murine HIAP2. 

1 28. A method of detecting an IAP gene in an animal 

2 cell, said method comprising: 

3 contacting the DNA of claim 13 or a portion thereof 

4 greater than about 18 nucleic acids in length with a 

5 preparation of genomic DNA from said animal cell under 
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6 hybridization conditions providing detection of DNA 

7 sequences having about 50% or greater nucleotide sequence 

8 identity to the sequence of Fig. 5. 

1 29. A method of detecting an IAP gene in an animal 

2 cell, said method comprising: 

3 contacting the DNA of claim 14 or a portion thereof 

4 greater than about 18 nucleic acids in length with a 

5 preparation of genomic DNA from said animal cell under 

6 hybridization conditions providing detection of DNA 

7 sequences having about 50% or greater nucleotide sequence 

8 identity to the sequence of Fig. 6. 



1 3 0. A method of producing an IAP polypeptide 

2 comprising: 

3 providing a cell transformed with DNA encoding an 

4 IAP polypeptide positioned for expression in said cell; 

5 culturing said transformed cell under conditions for 

6 expressing said DNA; and 

7 isolating said IAP polypeptide. 

1 31. The method of claim 30, wherein said IAP 

2 polypeptide is murine HIAP1. 

1 32. The method of claim 30, wherein said IAP 

2 polypeptide is murine HIAP2 . 

1 33. A method of identifying a compound which 

2 modulates apoptosis, said method comprising (a) providing a 

3 cell expressing an IAP polypeptide; and (b) contracting said 

4 cell with a candidate compound and monitoring the expression 

5 of an IAP gene, an alteration in the level of expression of 
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6 said gene indicating the presence of a compound which 

7 modulates apoptosis. 

1 34. The method of claim 33, wherein said IAP gene 

2 is murine HIAP1. 

1 35. The method of claim 33, wherein said IAP gene 

2 is murine HIAP2 . 

1 36. A method for detecting a protein that interacts 

2 with an IAP polypeptide comprising the steps of: 

3 a. contacting under suitable conditions an IAP 

4 protein with a compound suspected to be a modulator of 

5 apoptosis; and 

6 b. detecting the interaction of said compound with 

7 said IAP polypeptide, wherein said interaction indicates 

8 that said compound is involved in the modulation of 

9 apoptosis . 

1 37. The method of claim 36, wherein said IAP 

2 polypeptide is MAPI. 

1 38. The method of claim 36, wherein said IAP 

2 polypeptide is HIAP2 . 

1 39. The method of claim 36, wherein said IAP 

2 polypeptide is XIAP. 

1 40. The method of claim 36, wherein said 

2 interaction is detected by measuring the transcriptional 

3 activity of a reporter gene. 
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1 41. The method of claim 36, wherein said 

2 interaction occurs in a yeast cell. 

1 42. The method of claim 36, wherein said compound 

2 is a polypeptide. 

1 43. The method of claim 42, wherein said 

2 polypeptide is expressed from a recombinant nucleic acid. 

1 44. A method of diagnosing an increased liklihood 

2 of a cell proliferative disease in a patient, said method 

3 comprising detecting the level of IAP gene expression in 

4 said patient. 

1 45. A method of diagnosing an increased liklihood 

2 of a cell proliferative disease in a patient, said method 

3 comprising detecting the level of IAP polypeptide activity 

4 in said patient. 

T 46. A transgenic rodent having a knockout mutation 

2 in an IAP gene. 

1 47. A transgenic rodent, said rodent having 

2 additional copies of IAP nucleic acids added to its genome. 
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MAMMALIAN IAP GENE FAMILY, PRIMERS. PROBES , 
AND DETECTION METHODS 
ABSTRACT OF THE DISCLOSURE 
Disclosed is substantially pure DNA encoding 
mammalian IAP polypeptides; substantially pure polypeptides; 
and methods of using such DNA to express the IAP 
polypeptides in cells and animals to inhibit apoptosis. 
Also disclosed are conserved regions characteristic of the 
IAP family and primers and probes for the identification and 
isolation of additional IAP genes. In addition, methods for 
treating diseases and disorders involving apoptosis are 
provided. 
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SEQ 
SEQ 
"SEQ 
SEQ 
SEQ 
SEQ 



ID NO: 12 
ID NO:13 
ID NO: LO 
ID N0:4 
ID NO: 6 
ID NO: 8 



SEQ ID NO: 44 



cp-4ap L 

<tf*p * mtelgMelbs vRLaTFgeW? InaPVSaedL 

m-xiap . ..mtfnafe gtrtfvladt nkdeEFvejEF nRLkTFanFP aaaPVSastL 

xiap . ..mtfr.afe gsktcvpadi nkeeEFveEF nRLkTFanF? sgsPVSaatL 

hiapl jnnivensifl "anirnksantf ''elkyDLscEL yRMsTYazF? agvPVSeraL 

hiap2 .,.medscil sdwtns.nkq jcakyDFacEL yRMsTYstF? ag-PVSersL 

consensus ~ _ F — k- -rl-tf — ?? — ?vs L 



BIR1 



100 



cp-iap 
diap 

m-xiap 
xlap 
hiapl 
hiap2 
consensus 



vanGFFaTGk 
ARAGFLYTGe 
ARAGFLYTGe 
ARAGFYYTGv 
ARAGFYYTGv 
ARAGF-YTG- 



wleaeChfCh vriDrWeyGD qvaerKrr sa PiCamVjl 
gDtVqCFsCh aaiDrWqyGD SavgrHrris PnCrFI 
gDtVrCFsCh aavDrWqyGD SavgrHrkvs PnCrF 
nDkVkCFcCg ImlDnWkrGD SptekSkkly PsCrF, 
nDkVkCFcCg imlDnWklGD SpiqkHkqly PsCaFI 
-D-V-CF-C D-W — GD S H P-OFI 



a . - 

IlngFy 
IngFy 
VqsLn 
qnLv 



101 150 

cp-«P - 

dlap P nhcgnvpraq 

m-xiap feng aaqStnpgiq ngqyksenCv gnrnpfapdR 

Xiap lens atqStnsgiq ngqykvenYl gsrdhfaldR 

' 'Map1 svnnleatsq ptfpssvths .thSllpgte nsgyfrgaYa napsnpvnsR 
hiap2 s.aslgstak nt..spmrns fahSlaptle haalfagsYa slppnplnsR 
consensus s Y R 



cp-Iap 
dlap 
m-xiap 
xiap 
hiapl 
hiap2 
consensus 



151 

; niSD 

esDnegnsw dspescscpD 
ppEthadyli rtgqwDiSD 
psErhadyll rtgqwDiSD- 

anq EfSa 

avE DiSs 



200 



Irl 

Ill 

tiyprnp .aM 
tiyprnp, aM 
lmrssypcpM 
srtnpysyaM s 
M -H 



. JEEvRLnTF 
• •EanRLvTF 
csEEARLksF 
ycEEARLksF 
nrEnARLlTF 
tEEARFlTY 
EEARL-TF 



ekWPv.sfLa 
kdWPn.pnit 
qnWPdyahLt 
qnWPdyahLt 
qtWP . ltf Ls 
hroWP.ltfLs 
— WP L- 



cp-lap 
dlap 
m-xiap 
xiap 
hiapl 
hiap2 
consensus 



101 



BIR2 



PetMAknGFY 
PqaLAkAGFY 
PrELAsAGLY 
PrELAsAGLY 
PtOLArAGFY 
PaELArAGFY 
P-ELA-AGFY 



YlGrsDeVrC 
YlnrlDhVkC 
YtGadDqVqC 
YtGigDqVqC 
YiGpgDrVaC 
YiGpgDrVaC 
v-g — 0-V-C 



afCkveimrW 
wCnGviakW 
FcCGGXZkNW 
FcCGGXLkNW 
FaCGGXLaNW 
FaCGGXLaNW 
F-CGGXL-NW 



kegEdpaaDH 
EknDnAfeEH 
EPcDrAwSEH 
EPcDrAwSEH 
EPkDnAmSEE 
EPkDdAmSEE 
EP-D-A-SEX 



kkwaPqCPFV 
kRfFPqCPrV 
rRHFPnCf FV 
rRHFPnCf FV 
IRHFPkCPFI 
zREFPnCFFl 
-RHFP-CPFV 



B1R3 



300 



251 • 3UU 
cp-iap kgidvcgaiv ttnniqntrt hdtiigPahP kyAhjeaARvk sFhnWPrcmk 
diap qmgisiie.fa tgknldelgi qpttl.PlrP kyAcjvdARlr TFtdWPiSni 
m-xlap Igrnvnvrse a.gvssdrnF pnStrxsPrNP aMAeyeARlv TFgcWiyS . . 
Xiap igrnlnirse sdavaadrnF pnStniPrM? sHAdyeARif TFgtWiyS . . 

hiapl enqlqdtarY tvS Nl sMqthaARfk TFfnWPsSvl 

hiap2 : - , enai.etirF aiS Nl aMqtjhaARraz TFmyWPsSvp 

consensus 1111111111 .1-1 r ~s— ?-np -Ma4--AR~ TF w?«-s 
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2 5C 



cp-iap qrpEGMAdAG 
dlap qpasaLAcAG 
m-xiap VnkEGLArAG 
xiap 
hiapl 
hiap2 vqpEC—AsAG 
consensus 



TYnGvGDrK XCFyCcGGLk dWepeDvpWe C 

-YYckiGCcV rCFhGniGLr sWqkeDEPWf e 

"YaiGeGDkV XGFhCgGGLc cWkpsEEPWd C 

"izlGcGZW KCFhCgGGLc dWkpsFDFWe Z 

"'YvGr.r^cV KCFcCdGGLr cWesgZDP^v G 

"YYvGrr.EdV KCFcCdGGLr c*WescDC?Wv e 

::Y-G-GL-V F.CF-C-GGL- — 3C?W- G 



vrWFdrCa 
iA K*s ?kCq 

AKcYPcCk 
AX^YPcCk 
A KV?F? rCe 



cp-iap |Y vtq i v K G r E VckV:;. 

diap ; F-ilia:<Gpa^ Vsevlaitaa 

m-xiap iYiildeXGCF- I-nlhithp . 

Xiap YlileqKGGE'r Zr.r.Zr.lzr.s . 

hiapl .Y^ir-XGCIF 



.6 -CI 



;aTa; 
rrTai 



hiap2 rlii™J<GQFF 



consensus yl ;<gcey 
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rKilssGcaF 2~lceLlhDi 
IslevLZaDL 
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cp-lap . . acVLpce . 

diap . .advisee a 

m-xiap ;<i3dti:cr.? -VceAirMGF srkcIXkme eSlcraGssY 

Xlap riDcTilfqn? ^VqeAirMC-F sfkdIXkirae eKIqisGsnY kslevlVaCL 

hiapl seDalMnnc? vlr.aAveMGF srsiVXcivc rXIla-Gen: 

hiap2 seDavMirJi::? vvksAleMGF 
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m-xiap k aiStZEQLRR LqEEkLsKIC MDrnlaZVTf 

x'^P ;< eiStEEQLKR LqEEkLCXIC MDrnlalVFV 

hiapl ivehifvccc ikyipcedvs dlpvE£QLRR LpEErtCKVC MDkEVsIVFI 
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cp-iap pcGKWaCak CAiSVdKCrM CRklvtsvik vYFS. 

diap pcGKLatCnq CApSVanCPM CRacIkc:vr zfLS* 

m-Xiap PCGKlatCkc CAeavcXCPM CVtVIcfnck iJMS T 

XI a p PCGHLVtCkq CAeaVcXCPM CVtVItfkqk iFMS* 

hiapl PCGHLVvCkc CApSlrXCPi CRszIkgcvr tELS* 

hiap2 PCGKLVvCqe CApSirKCPi CRgllkg-vr ~ELS . 

consensus ?cgklv-c-- ca-sv-;<c?m ck— : -els- 
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My residence, post office address and citizenship are as stated below next to my name, 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, 
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which a patent is sought on the invention entitled MAMMALIAN IAP GENE FAMILY, PRIMERS, 
PROBES AND DETECTIONS METHODS, the specification of which 

□ is attached hereto. 
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□ was described and claimed in PCT International Application No. 

filed on and as amended under PCT Article 19 on 
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SEQUENCE LISTING 

<110> Korneluk, Robert G. 

Mackenzie, Alexander E. 
Baird, Stephen 
Liston, Peter 

<12 0> MAMMALIAN IAP GENE FAMILY, PRIMERS, 
PROBES, AND DETECTION METHODS 

<130> 07891/003005 

<150> 08/576,956 
<151> 1995-12-22 

<150> 08/511,485 
<151> 1995-08-04 

<160> 92 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 46 
<212> PRT 

<213> artificial sequence based on Homo sapiens, Mus musculus, Cydia 
pomonella, Orgyia pseudotsugata, and Drosophila melanogaster. 

<220> 

<221> VARIANT 
<222> 8 

<223> Glu or Asp 

<221> VARIANT 
<222> 14,22 
<223> Val or He 

<221> VARIANT 

<222> (1) . . . (46) 

<22 3> Xaa = Any Amino Acid 

<400> 1 

Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Lys Xaa Cys Met 

15 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Xaa Pro Cys Gly His Xaa Xaa Xaa 

20 25 30 

Cys Xaa Xaa Cys Ala Xaa Xaa Xaa Xaa Xaa Cys Pro Xaa Cys 
35 40 45 



-1- 



<210> 2 
<211> 68 
<212> PRT 

<213> artificial sequence based on Homo sapiens, Mus musculus, Cydia 
pomonella, Orgyia pseudotsugata, and Drosophila melanogaster . 



<220> 

<221> VARIANT 
<222> 13,16,17 

<22 3> any amino acid or absent 

<221> VARIANT 

<222> (1) . . . (68) 

<2 23> Xaa = Any Amino Acid 



<400> 2 

Xaa Xaa Xaa Arg 
1 

Xaa Xaa Xaa Xaa 
20 

Xaa Asp Xaa Val 
35 

Xaa Xaa Xaa Asp 
50 

Cys Xaa Phe Val 
65 



Leu Xaa Thr Phe 
5 

Xaa Leu Ala Xaa 

Xaa Cys Phe Xaa 
40 

Xaa Xaa Xaa Xaa 
55 



Xaa Xaa Trp Pro 
10 

Ala Gly Phe Tyr 
25 

Cys Xaa Xaa Xaa 

Xaa His Xaa Xaa 
60 



Xaa Xaa Xaa Xaa 
15 

Tyr Xaa Gly Xaa 
30 

Xaa Xaa Xaa Trp 
45 

Xaa Xaa Pro Xaa 



<210> 3 

<211> 2540 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (2540) 
<223> n = A,T,C, or G 



<400> 3 

gaaaaggtgg acaagtccta ttttcaagag aagatgactt ttaacagttt tgaaggatct 60 
aaaacttgtg tacctgcaga catcaataag gaagaagaat ttgtagaaga gtttaataga 12 0 
ttaaaaactt ttgctaattt tccaagtggt agtcctgttt cagcatcaac actggcacga 180 
gcagggtttc tttatactgg tgaaggagat accgtgcggt gctttagttg tcatgcagct 240 
gtagatagat ggcaatatgg agactcagca gttggaagac acaggaaagt atccccaaat 300 
tgcagattta tcaacggctt ttatcttgaa aatagtgcca cgcagtctac aaattctggt 360 
atccagaatg gtcagtacaa agttgaaaac tatctgggaa gcagagatca ttttgcctta 42 0 
gacaggccat ctgagacaca tgcagactat cttttgagaa ctgggcaggt tgtagatata 480 
tcagacacca tatacccgag gaaccctgcc atgtattgtg aagaagctag attaaagtcc 54 0 
tttcagaact ggccagacta tgctcaccta accccaagag agttagcaag tgctggactc 600 
tactacacag gtattggtga ccaagtgcag tgcttttgtt gtggtggaaa actgaaaaat 660 
tgggaacctt gtgatcgtgc ctggtcagaa cacaggcgac actttcctaa ttgcttcttt 72 0 
gttttgggcc ggaatcttaa tattcgaagt gaatctgatg ctgtgagttc tgataggaat 78 0 
ttcccaaatt caacaaatct tccaagaaat ccatccatgg cagattatga agcacggatc 84 0 



tttacttttg ggacatggat atactcagtt aacaaggagc agcttgcaag agctggattt 900 
tatgctttag gtgaaggtga taaagtaaag tgctttcact gtggaggagg gctaactgat 960 
tggaagccca gtgaagaccc ttgggaacaa catgctaaat ggtatccagg gtgcaaatat 102 0 
ctgttagaac agaagggaca agaatatata aacaatattc atttaactca ttcacttgag 108 0 
gagtgtctgg taagaactac tgagaaaaca ccatcactaa ctagaagaat tgatgatacc 114 0 
atcttccaaa atcctatggt acaagaagct atacgaatgg ggttcagttt caaggacatt 1200 
aagaaaataa tggaggaaaa aattcagata tctgggagca actataaatc acttgaggtt 12 60 
ctggttgcag atctagtgaa tgctcagaaa gacagtatgc aagatgagtc aagtcagact 132 0 
tcattacaga aagagattag tactgaagag cagctaaggc gcctgcaaga ggagaagctt 13 80 
tgcaaaatct gtatggatag aaatattgct atcgtttttg ttccttgtgg acatctagtc 144 0 
acttgtaaac aatgtgctga agcagttgac aagtgtccca tgtgctacac agtcattact 1500 
ttcaagcaaa aaatttttat gtcttaatct aactctatag taggcatgtt atgttgttct 1560 
tattaccctg attgaatgtg tgatgtgaac tgactttaag taatcaggat tgaattccat 162 0 
tagcatttgc taccaagtag gaaaaaaaat gtacatggca gtgttttagt tggcaatata 1680 
atctttgaat ttcttgattt ttcagggtat tagctgtatt atccattttt tttactgtta 1740 
tttaattgaa accatagact aagaataaga agcatcatac tataactgaa cacaatgtgt 18 0 0 
attcatagta tactgattta atttctaagt gtaagtgaat taatcatctg gattttttat 1860 
tcttttcaga taggcttaac aaatggagct ttctgtatat aaatgtggag attagagtta 1920 
atctccccaa tcacataatt tgttttgtgt gaaaaaggaa taaattgttc catgctggtg 1980 
gaaagataga gattgttttt agaggttggt tgttgtgttt taggattctg tccattttct 2 040 
tgtaaaggga taaacacgga cgtgtgcgaa atatgtttgt aaagtgattt gccattgttg 2100 
aaagcgtatt taatgataga atactatcga gccaacatgt actgacatgg aaagatgtca 2160 
gagatatgtt aagtgtaaaa tgcaagtggc gggacactat gtatagtctg agccagatca 222 0 
aagtatgtat gttgttaata tgcatagaac gagagatttg gaaagatata caccaaactg 22 80 
ttaaatgtgg tttctcttcg gggagggggg gattggggga ggggccccag aggggtttta 234 0 
gaggggcctt ttcactttcg acttttttca ttttgttctg ttcggatttt ttataagtat 2400 
gtagaccccg aagggtttta tgggaactaa catcagtaac ctaacccccg tgactatcct 2460 
gtgctcttcc tagggagctg tgttgtttcc cacccaccac ccttccctct gaacaaatgc 2520 
ctgagtgctg gggcactttn 2540 

<210> 4 
<211> 497 
<212> PRT 

<213> Homo sapiens 



<400> 4 

Met Thr Phe Asn 
1 

lie Asn Lys Glu 
20 

Phe Ala Asn Phe 
35 

Arg Ala Gly Phe 
50 

Ser Cys His Ala 
65 

Gly Arg His Arg 

Tyr Leu Glu Asn 
100 

Gly Gin Tyr Lys 
115 

Leu Asp Arg Pro 
130 



Ser Phe Glu Gly 
5 

Glu Glu Phe Val 

Pro Ser Gly Ser 
40 

Leu Tyr Thr Gly 
55 

Ala Val Asp Arg 
70 

Lys Val Ser Pro 
85 

Ser Ala Thr Gin 

Val Glu Asn Tyr 
120 

Ser Glu Thr His 
135 



Ser Lys Thr Cys 
10 

Glu Glu Phe Asn 
25 

Pro Val Ser Ala 

Glu Gly Asp Thr 
60 

Trp Gin Tyr Gly 
75 

Asn Cys Arg Phe 
90 

Ser Thr Asn Ser 
105 

Leu Gly Ser Arg 

Ala Asp Tyr Leu 
140 



Val Pro Ala Asp 
15 

Arg Leu Lys Thr 
30 

Ser Thr Leu Ala 
45 

Val Arg Cys Phe 

Asp Ser Ala Val 
80 

lie Asn Gly Phe 
95 

Gly lie Gin Asn 
110 

Asp His Phe Ala 
125 

Leu Arg Thr Gly 
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Gin 


Val 


Val 


Asp 


He 


Ser Asp 


Thr 


He 


Tyr 


Pro 


Arg 


Asn 


Pro 


Ala 


Met 


145 










150 










155 










160 


Tyr 


Cys 


Glu 


Glu 


Ala 


Arg 


Leu 


Lys 


Ser 


Phe 


Gin 


Asn 


Trp 


Pro 


Asp 


Tyr 










165 










170 










175 




Ala 


His 


Leu 


Thr 


Pro 


Arg 


Glu 


Leu 


Ala 


Ser 


Ala 


Gly 


Leu 


Tyr 


Tyr 


Thr 








180 










185 










190 






Gly 


lie 


Gly 


Asp 


Gin 


Val 


Gin 


Cys 


Phe 


Cys 


Cys 


Gly 


Gly 


Lys 


Leu 


Lys 






195 










200 










205 








Asn 


Trp 


Glu 


Pro 


Cys 


Asp 


Arg 


Ala 


Trp 


Ser 


Glu 


His 


Arg 


Arg 


His 


Phe 




210 










215 










220 










Pro 


Asn 


Cys 


Phe 


Phe 


Val 


Leu Gly Arg Asn 


Leu 


Asn 


He 


Arg 


Ser 


Glu 


225 










230 










235 










240 


Ser 


Asp 


Ala 


Val 


Ser 


Ser 


Asp 


Arg 


Asn 


Phe 


Pro 


Asn 


Ser 


Thr 


Asn 


Leu 










245 










250 










255 




Pro 


Arg 


Asn 


Pro 


Ser 


Met 


Ala 


Asp 


Tyr 


Glu 


Ala 


Arg 


He 


Phe 


Thr 


Phe 








260 










265 










270 






Gly 


Thr 


Trp 


He 


Tyr 


Ser 


Val 


Asn 


Lys 


Glu 


Gin 


Leu 


Ala 


Arg 


Ala 


Gly 






275 










280 










285 








Phe 


Tyr 


Ala 


Leu 


Gly 


Glu Gly Asp Lys 


Val 


Lys 


Cys 


Phe 


His 


Cys 


Gly 




290 










295 










300 










Gly 


Gly 


Leu 


Thr 


Asp 


Trp 


Lys 


Pro 


Ser 


Glu 


Asp 


Pro 


Trp 


Glu 


Gin 


His 


305 










310 










315 










320 


Ala 


Lys 


Trp 


Tyr 


Pro 


Gly Cys 


Lys 


Tyr 


Leu 


Leu 


Glu 


Gin 


Lys 


Gly 


Gin 










325 










330 










335 




Glu 


Tyr 


He 


Asn 


Asn 


He 


His 


Leu 


Thr 


His 


Ser 


Leu 


Glu 


Glu 


Cys 


Leu 








340 










345 










350 






Val 


Arg 


Thr 


Thr 


Glu 


Lys 


Thr 


Pro 


Ser 


Leu 


Thr 


Arg 


Arg 


He 


Asp 


Asp 






355 










360 










365 








Thr 


He 


Phe 


Gin 


Asn 


Pro 


Met 


Val 


Gin 


Glu 


Ala 


He 


Arg 


Met 


Gly 


Phe 




370 










375 










380 










Ser 


Phe 


Lys 


Asp 


He 


Lys 


Lys 


He 


Met 


Glu 


Glu 


Lys 


He 


Gin 


He 


Ser 


385 










390 










395 










400 


Gly 


Ser 


Asn 


Tyr 


Lys 


Ser 


Leu 


Glu 


Val 


Leu 


Val 


Ala 


Asp 


Leu 


Val 


Asn 










405 










410 










415 




Ala 


Gin 


Lys 


Asp 


Ser 


Met 


Gin Asp 


Glu 


Ser 


Ser 


Gin 


Thr 


Ser 


Leu 


Gin 








420 










425 










430 






Lys 


Glu 


He 


Ser 


Thr 


Glu 


Glu 


Gin 


Leu Arg 


Arg 


Leu 


Gin 


Glu 


Glu 


Lys 






435 










440 










445 








Leu 


Cys 


Lys 


He 


Cys 


Met 


Asp 


Arg 


Asn 


He 


Ala 


He 


Val 


Phe 


Val 


Pro 




450 










455 










460 










Cys 


Gly 


His 


Leu 


Val 


Thr 


Cys 


Lys 


Gin Cys 


Ala 


Glu 


Ala 


Val 


Asp 


Lys 


465 










470 










475 










480 


Cys 


Pro 


Met 


Cys 


Tyr 


Thr 


Val 


He 


Thr 


Phe 


Lys 


Gin 


Lys 


He 


Phe 


Met 










485 










490 










495 





Ser 



<210> 5 

<211> 2676 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> Variant 
<222> (1) . . . (2676) 
<223> n = A,T,C,or G 

<400> 5 

tccttgagat gtatcagtat aggatttagg atctccatgt tggaactcta aatgcataga 6 0 
aatggaaata atggaaattt ttcattttgg cttttcagcc tagtattaaa actgataaaa 12 0 
gcaaagccat gcacaaaact acctccctag agaaaggcta gtcccttttc ttccccattc 180 
atttcattat gaacatagta gaaaacagca tattcttatc aaatttgatg aaaagcgcca 240 
acacgtttga actgaaatac gacttgtcat gtgaactgta ccgaatgtct acgtattcca 3 00 
cttttcctgc tggggttcct gtctcagaaa ggagtcttgc tcgtgctggt ttctattaca 360 
ctggtgtgaa tgacaaggtc aaatgcttct gttgtggcct gatgctggat aactggaaaa 42 0 
gaggagacag tcctactgaa aagcataaaa agttgtatcc tagctgcaga ttcgttcaga 480 
gtctaaattc cgttaacaac ttggaagcta cctctcagcc tacttttcct tcttcagtaa 540 
cacattccac acactcatta cttccgggta cagaaaacag tggatatttc cgtggctctt 600 
attcaaactc tccatcaaat cctgtaaact ccagagcaaa tcaagaattt tctgccttga 660 
tgagaagttc ctacccctgt ccaatgaata acgaaaatgc cagattactt acttttcaga 720 
catggccatt gacttttctg tcgccaacag atctggcacg agcaggcttt tactacatag 780 
gacctggaga cagagtggct tgctttgcct gtggtggaaa attgagcaat tgggaaccga 84 0 
aggataatgc tatgtcagaa cacctgagac attttcccaa atgcccattt atagaaaatc 900 
agcttcaaga cacttcaaga tacacagttt ctaatctgag catgcagaca catgcagccc 960 
gctttaaaac attctttaac tggccctcta gtgttctagt taatcctgag cagcttgcaa 102 0 
gtgcgggttt ttattatgtg ggtaacagtg atgatgtcaa atgcttttgc tgtgatggtg 1080 
gactcaggtg ttgggaatct ggagatgatc catgggttca acatgccaag tggtttccaa 1140 
ggtgtgagta cttgataaga attaaaggac aggagttcat ccgtcaagtt caagccagtt 12 0 0 
accctcatct acttgaacag ctgctatcca catcagacag cccaggagat gaaaatgcag 1260 
agtcatcaat tatccatttg gaacctggag aagaccattc agaagatgca atcatgatga 132 0 
atactcctgt gattaatgct gccgtggaaa tgggctttag tagaagcctg gtaaaacaga 1380 
cagttcagag aaaaatccta gcaactggag agaattatag actagtcaat gatcttgtgt 144 0 
tagacttact caatgcagaa gatgaaataa gggaagagga gagagaaaga gcaactgagg 150 0 
aaaaagaatc aaatgattta ttattaatcc ggaagaatag aatggcactt tttcaacatt 1560 
tgacttgtgt aattccaatc ctggatagtc tactaactgc cggaattatt aatgaacaag 162 0 
aacatgatgt tattaaacag aagacacaga cgtctttaca agcaagagaa ctgattgata 1680 
cgattttagt aaaaggaaat attgcagcca ctgtattcag aaactctctg caagaagctg 174 0 
aagctgtgtt atatgagcat ttatttgtgc aacaggacat aaaatatatt cccacagaag 1800 
atgtttcaga tctaccagtg gaagaacaat tgcggagact accagaagaa agaacatgta 1860 
aagtgtgtat ggacaaagaa gtgtccatag tgtttattcc ttgtggtcat ctagtagtat 192 0 
gcaaagattg tgctccttct ttaagaaagt gtcctatttg taggagtaca atcaagggta 1980 
cagttcgtac atttctttca tgaagaagaa ccaaaacatc gtctaaactt tagaattaat 2040 
ttattaaatg tattataact ttaactttta tcctaatttg gtttccttaa aatttttatt 2100 
tatttacaac tcaaaaaaca ttgttttgtg taacatattt atatatgtat ctaaaccata 2160 
tgaacatata ttttttagaa actaagagaa tgataggctt ttgttcttat gaacgaaaaa 2220 
gaggtagcac tacaaacaca atattcaatc caaatttcag cattattgaa attgtaagtg 22 8 0 
aagtaaaact taagatattt gagttaacct ttaagaattt taaatatttt ggcattgtac 2340 
taataccggg aacatgaagc caggtgtggt ggtatgtacc tgtagtccca ggctgaggca 24 00 
agagaattac ttgagcccag gagtttgaat ccatcctggg cagcatactg agaccctgcc 2460 
tttaaaaacn aacagnacca aanccaaaca ccagggacac atttctctgt cttttttgat 2520 
cagtgtccta tacatcgaag gtgtgcatat atgttgaatc acattttagg gacatggtgt 2580 
ttttataaag aattctgtga gnaaaaattt aataaagcaa ccaaattact cttaaaaaaa 2640 
aaaaaaaaaa aaaaaactcg aggggcccgt accaat 2676 
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<210> 6 
<211> 604 
<212> PRT 

<213> Homo sapiens 



<400> 6 

Met Asn lie Val 
1 

Ala Asn Thr Phe 
20 

Met Ser Thr Tyr 
35 

Ser Leu Ala Arg 
50 

Lys Cys Phe Cys 
65 

Ser Pro Thr Glu 

Gin Ser Leu Asn 
100 

Phe Pro Ser Ser 
115 

Glu Asn Ser Gly 
130 

Pro Val Asn Ser 
145 

Ser Tyr Pro Cys 

Gin Thr Trp Pro 
180 

Gly Phe Tyr Tyr 
195 

Gly Gly Lys Leu 
210 

His Leu Arg His 
225 

Asp Thr Ser Arg 

Ala Arg Phe Lys 
260 

Pro Glu Gin Leu 
275 

Asp Val Lys Cys 
290 

Gly Asp Asp Pro 
305 

Tyr Leu lie Arg 

Ser Tyr Pro His 
340 

Gly Asp Glu Asn 
355 



Glu Asn Ser lie 
5 

Glu Leu Lys Tyr 

Ser Thr Phe Pro 
40 

Ala Gly Phe Tyr 
55 

Cys Gly Leu Met 
70 

Lys His Lys Lys 
85 

Ser Val Asn Asn 

Val Thr His Ser 
120 

Tyr Phe Arg Gly 
135 

Arg Ala Asn Gin 
150 

Pro Met Asn Asn 
165 

Leu Thr Phe Leu 

lie Gly Pro Gly 
200 

Ser Asn Trp Glu 
215 

Phe Pro Lys Cys 
230 

Tyr Thr Val Ser 
245 

Thr Phe Phe Asn 

Ala Ser Ala Gly 
280 

Phe Cys Cys Asp 
295 

Trp Val Gin His 
310 

lie Lys Gly Gin 
325 

Leu Leu Glu Gin 

Ala Glu Ser Ser 
360 



Phe Leu Ser Asn 
10 

Asp Leu Ser Cys 
25 

Ala Gly Val Pro 

Tyr Thr Gly Val 
60 

Leu Asp Asn Trp 
75 

Leu Tyr Pro Ser 
90 

Leu Glu Ala Thr 
105 

Thr His Ser Leu 

Ser Tyr Ser Asn 
14 0 

Glu Phe Ser Ala 
155 

Glu Asn Ala Arg 
170 

Ser Pro Thr Asp 
185 

Asp Arg Val Ala 

Pro Lys Asp Asn 
220 

Pro Phe lie Glu 
235 

Asn Leu Ser Met 
250 

Trp Pro Ser Ser 
265 

Phe Tyr Tyr Val 

Gly Gly Leu Arg 
300 

Ala Lys Trp Phe 
315 

Glu Phe lie Arg 
330 

Leu Leu Ser Thr 
345 

lie lie His Leu 



Leu Met Lys Ser 
15 

Glu Leu Tyr Arg 
30 

Val Ser Glu Arg 
45 

Asn Asp Lys Val 

Lys Arg Gly Asp 
80 

Cys Arg Phe Val 
95 

Ser Gin Pro Thr 
110 

Leu Pro Gly Thr 
125 

Ser Pro Ser Asn 

Leu Met Arg Ser 
160 

Leu Leu Thr Phe 
175 

Leu Ala Arg Ala 
190 

Cys Phe Ala Cys 
205 

Ala Met Ser Glu 

Asn Gin Leu Gin 
240 

Gin Thr His Ala 
255 

Val Leu Val Asn 
270 

Gly Asn Ser Asp 
285 

Cys Trp Glu Ser 

Pro Arg Cys Glu 
320 

Gin Val Gin Ala 
335 

Ser Asp Ser Pro 
350 

Glu Pro Gly Glu 
365 
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Asp His Ser Glu 
370 

Ala Val Glu Met 
385 

Arg Lys lie Leu 

Val Leu Asp Leu 
420 

Glu Arg Ala Thr 
435 

Lys Asn Arg Met 
450 

Leu Asp Ser Leu 
465 

Val lie Lys Gin 

Asp Thr lie Leu 
500 

Ser Leu Gin Glu 
515 

Gin Asp lie Lys 
530 

Glu Glu Gin Leu 
545 

Met Asp Lys Glu 

Val Cys Lys Asp 
580 

Ser Thr He Lys 
595 



Asp Ala He Met 
375 

Gly Phe Ser Arg 
390 

Ala Thr Gly Glu 
405 

Leu Asn Ala Glu 

Glu Glu Lys Glu 
440 

Ala Leu Phe Gin 
455 

Leu Thr Ala Gly 
470 

Lys Thr Gin Thr 
485 

Val Lys Gly Asn 

Ala Glu Ala Val 
520 

Tyr He Pro Thr 
535 

Arg Arg Leu Pro 
550 

Val Ser He Val 
565 

Cys Ala Pro Ser 

Gly Thr Val Arg 
600 



Met Asn Thr Pro 
380 

Ser Leu Val Lys 
395 

Asn Tyr Arg Leu 
410 

Asp Glu He Arg 
425 

Ser Asn Asp Leu 

His Leu Thr Cys 
460 

He He Asn Glu 
475 

Ser Leu Gin Ala 
490 

He Ala Ala Thr 
505 

Leu Tyr Glu His 

Glu Asp Val Ser 
540 

Glu Glu Arg Thr 
555 

Phe He Pro Cys 
570 

Leu Arg Lys Cys 
585 

Thr Phe Leu Ser 



Val He Asn Ala 

Gin Thr Val Gin 
400 

Val Asn Asp Leu 
415 

Glu Glu Glu Arg 
430 

Leu Leu He Arg 
445 

Val He Pro He 

Gin Glu His Asp 
480 

Arg Glu Leu He 
495 

Val Phe Arg Asn 
510 

Leu Phe Val Gin 
525 

Asp Leu Pro Val 

Cys Lys Val Cys 
560 

Gly His Leu Val 
575 

Pro He Cys Arg 
590 



<210> 7 

<211> 2580 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> Variant 
<222> (1) . . . (2580) 
<223> n = A,T,C,or G 

<400> 7 

ttaggttacc tgaaagagtt actacaaccc caaagagttg tgttctaagt agtatcttgg 60 
taattcagag agatactcat cctacctgaa tataaactga gataaatcca gtaaagaaag 12 0 
tgtagtaaat tctacataag agtctatcat tgatttcttt ttgtggtgga aatcttagtt 18 0 
catgtgaaga aatttcatgt gaatgtttta gctatcaaac agtactgtca cctactcatg 240 
cacaaaactg cctcccaaag acttttccca ggtccctcgt atcaaaacat taagagtata 300 
atggaagata gcacgatctt gtcagattgg acaaacagca acaaacaaaa aatgaagtat 3 60 
gacttttcct gtgaactcta cagaatgtct acatattcaa ctttccccgc cggggtgcct 42 0 
gtctcagaaa ggagtcttgc tcgtgctggt ttttattata ctggtgtgaa tgacaaggtc 480 
aaatgcttct gttgtggcct gatgctggat aactggaaac taggagacag tcctattcaa 54 0 
aagcataaac agctatatcc tagctgtagc tttattcaga atctggtttc agctagtctg 600 
ggatccacct ctaagaatac gtctccaatg agaaacagtt ttgcacattc attatctccc 660 
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accttggaac atagtagctt gttcagtggt tcttactcca gccttcctcc aaaccctctt 720 
aattctagag cagttgaaga catctcttca tcgaggacta acccctacag ttatgcaatg 780 
agtactgaag aagccagatt tcttacctac catatgtggc cattaacttt tttgtcacca 840 
tcagaattgg caagagctgg tttttattat ataggacctg gagatagggt agcctgcttt 900 
gcctgtggtg ggaagctcag taactgggaa ccaaaggatg atgctatgtc agaacaccgg 960 
aggcattttc ccaactgtcc atttttggaa aattctctag aaactctgag gtttagcatt 102 0 
tcaaatctga gcatgcagac acatgcagct cgaatgagaa catttatgta ctggccatct 1080 
agtgttccag ttcagcctga gcagcttgca agtgctggtt tttattatgt gggtcgcaat 114 0 
gatgatgtca aatgctttgg ttgtgatggt ggcttgaggt gttgggaatc tggagatgat 12 00 
ccatgggtag aacatgccaa gtggtttcca aggtgtgagt tcttgatacg aatgaaaggc 1260 
caagagtttg ttgatgagat tcaaggtaga tatcctcatc ttcttgaaca gctgttgtca 1320 
acttcagata ccactggaga agaaaatgct gacccaccaa ttattcattt tggacctgga 13 80 
gaaagttctt cagaagatgc tgtcatgatg aatacacctg tggttaaatc tgccttggaa 1440 
atgggcttta atagagacct ggtgaaacaa acagttctaa gtaaaatcct gacaactgga 1500 
gagaactata aaacagttaa tgatattgtg tcagcacttc ttaatgctga agatgaaaaa 1560 
agagaagagg agaaggaaaa acaagctgaa gaaatggcat cagatgattt gtcattaatt 162 0 
cggaagaaca gaatggctct ctttcaacaa ttgacatgtg tgcttcctat cctggataat 1680 
cttttaaagg ccaatgtaat taataaacag gaacatgata ttattaaaca aaaaacacag 174 0 
atacctttac aagcgagaga actgattgat accatttggg ttaaaggaaa tgctgcggcc 1800 
aacatcttca aaaactgtct aaaagaaatt gactctacat tgtataagaa cttatttgtg 1860 
gataagaata tgaagtatat tccaacagaa gatgtttcag gtctgtcact ggaagaacaa 192 0 
ttgaggaggt tgcaagaaga acgaacttgt aaagtgtgta tggacaaaga agtttctgtt 1980 
gtatttattc cttgtggtca tctggtagta tgccaggaat gtgccccttc tctaagaaaa 2040 
tgccctattt gcaggggtat aatcaagggt actgttcgta catttctctc ttaaagaaaa 2100 
atagtctata ttttaacctg cataaaaagg tctttaaaat attgttgaac acttgaagcc 2160 
atctaaagta aaaagggaat tatgagtttt tcaattagta acattcatgt tctagtctgc 222 0 
tttggtacta ataatcttgt ttctgaaaag atggtatcat atatttaatc ttaatctgtt 2280 
tatttacaag ggaagattta tgtttggtga actatattag tatgtatgtg tacctaaggg 2340 
agtagcgtcn ctgcttgtta tgcatcattt caggagttac tggatttgtt gttctttcag 2400 
aaagctttga anactaaatt atagtgtaga aaagaactgg aaaccaggaa ctctggagtt 2460 
catcagagtt atggtgccga attgtctttg gtgcttttca cttgtgtttt aaaataagga 2520 
tttttctctt atttctcccc ctagtttgtg agaaacatct caataaagtg ctttaaaaag 2580 



<210> 8 

<211> 618 

<212> PRT 

<213> Homo sapiens 



<400> 8 

Met His Lys Thr 
1 

Asn lie Lys Ser 
20 

Asn Ser Asn Lys 
35 

Arg Met Ser Thr 
50 

Arg Ser Leu Ala 
65 

Val Lys Cys Phe 

Asp Ser Pro lie 
100 



Ala Ser Gin Arg 
5 

lie Met Glu Asp 

Gin Lys Met Lys 
40 

Tyr Ser Thr Phe 
55 

Arg Ala Gly Phe 
70 

Cys Cys Gly Leu 
85 

Gin Lys His Lys 



Leu Phe Pro Gly 
10 

Ser Thr lie Leu 
25 

Tyr Asp Phe Ser 

Pro Ala Gly Val 
60 

Tyr Tyr Thr Gly 
75 

Met Leu Asp Asn 
90 

Gin Leu Tyr Pro 
105 



Pro Ser Tyr Gin 
15 

Ser Asp Trp Thr 
30 

Cys Glu Leu Tyr 
45 

Pro Val Ser Glu 

Val Asn Asp Lys 
80 

Trp Lys Leu Gly 
95 

Ser Cys Ser Phe 
110 
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lie Gin Asn Leu 
115 

Ser Pro Met Arg 
130 

His Ser Ser Leu 
145 

Leu Asn Ser Arg 

Tyr Ser Tyr Ala 
180 

Met Trp Pro Leu 
195 

Phe Tyr Tyr lie 
210 

Gly Lys Leu Ser 
225 

Arg Arg His Phe 

Leu Arg Phe Ser 
260 

Met Arg Thr Phe 
275 

Gin Leu Ala Ser 
290 

Lys Cys Phe Gly 
305 

Asp Pro Trp Val 

lie Arg Met Lys 
340 

Pro His Leu Leu 
355 

Glu Asn Ala Asp 
370 

Ser Glu Asp Ala 
385 

Glu Met Gly Phe 

lie Leu Thr Thr 
420 

Ala Leu Leu Asn 
435 

Gin Ala Glu Glu 
450 

Arg Met Ala Leu 
465 

Asn Leu Leu Lys 

Lys Gin Lys Thr 
500 

lie Trp Val Lys 
515 

Lys Glu lie Asp 
530 



Val Ser Ala Ser 
120 

Asn Ser Phe Ala 
135 

Phe Ser Gly Ser 
150 

Ala Val Glu Asp 
165 

Met Ser Thr Glu 

Thr Phe Leu Ser 
200 

Gly Pro Gly Asp 
215 

Asn Trp Glu Pro 
230 

Pro Asn Cys Pro 
245 

lie Ser Asn Leu 

Met Tyr Trp Pro 
280 

Ala Gly Phe Tyr 
295 

Cys Asp Gly Gly 
310 

Glu His Ala Lys 
325 

Gly Gin Glu Phe 

Glu Gin Leu Leu 
360 

Pro Pro lie lie 
375 

Val Met Met Asn 
390 

Asn Arg Asp Leu 
405 

Gly Glu Asn Tyr 

Ala Glu Asp Glu 
440 

Met Ala Ser Asp 
455 

Phe Gin Gin Leu 
470 

Ala Asn Val lie 
485 

Gin lie Pro Leu 

Gly Asn Ala Ala 
520 

Ser Thr Leu Tyr 
535 



Leu Gly Ser Thr 

His Ser Leu Ser 
140 

Tyr Ser Ser Leu 
155 

lie Ser Ser Ser 
170 

Glu Ala Arg Phe 
185 

Pro Ser Glu Leu 

Arg Val Ala Cys 
220 

Lys Asp Asp Ala 
235 

Phe Leu Glu Asn 
250 

Ser Met Gin Thr 
265 

Ser Ser Val Pro 

Tyr Val Gly Arg 
300 

Leu Arg Cys Trp 
315 

Trp Phe Pro Arg 
330 

Val Asp Glu lie 
345 

Ser Thr Ser Asp 

His Phe Gly Pro 
380 

Thr Pro Val Val 
395 

Val Lys Gin Thr 
410 

Lys Thr Val Asn 
425 

Lys Arg Glu Glu 

Asp Leu Ser Leu 
460 

Thr Cys Val Leu 
475 

Asn Lys Gin Glu 
490 

Gin Ala Arg Glu 
505 

Ala Asn lie Phe 

Lys Asn Leu Phe 
540 



Ser Lys Asn Thr 
125 

Pro Thr Leu Glu 

Pro Pro Asn Pro 
160 

Arg Thr Asn Pro 
175 

Leu Thr Tyr His 
190 

Ala Arg Ala Gly 
205 

Phe Ala Cys Gly 

Met Ser Glu His 
240 

Ser Leu Glu Thr 
255 

His Ala Ala Arg 
270 

Val Gin Pro Glu 
285 

Asn Asp Asp Val 

Glu Ser Gly Asp 
320 

Cys Glu Phe Leu 
335 

Gin Gly Arg Tyr 
350 

Thr Thr Gly Glu 
365 

Gly Glu Ser Ser 

Lys Ser Ala Leu 
400 

Val Leu Ser Lys 
415 

Asp lie Val Ser 
430 

Glu Lys Glu Lys 
445 

lie Arg Lys Asn 

Pro lie Leu Asp 
480 

His Asp lie lie 
495 

Leu lie Asp Thr 
510 

Lys Asn Cys Leu 
525 

Val Asp Lys Asn 



Met Lys Tyr lie 
545 

Gin Leu Arg Arg 

Lys Glu Val Ser 
580 

Gin Glu Cys Ala 
595 

lie Lys Gly Thr 
610 



Pro Thr Glu Asp 
550 

Leu Gin Glu Glu 
565 

Val Val Phe He 

Pro Ser Leu Arg 
600 

Val Arg Thr Phe 
615 



Val Ser Gly Leu 
555 

Arg Thr Cys Lys 
570 

Pro Cys Gly His 
585 

Lys Cys Pro He 
Leu Ser 



Ser Leu Glu Glu 
560 

Val Cys Met Asp 
575 

Leu Val Val Cys 
590 

Cys Arg Gly He 
605 



<210> 9 

<211> 2100 

<212> DMA 

<213> Mus musculus 

<400> 9 

gacactctgc tgggcggcgg gccgccctcc tccgggacct cccctcggga accgtcgccc 60 
gcggcgctta gttaggactg gagtgcttgg cgcgaaaagg tggacaagtc ctattttcca 12 0 
gagaagatga cttttaacag ttttgaagga actagaactt ttgtacttgc agacaccaat 18 0 
aaggatgaag aatttgtaga agagtttaat agattaaaaa catttgctaa cttcccaagt 24 0 
agtagtcctg tttcagcatc aacattggcg cgagctgggt ttctttatac cggtgaagga 3 00 
gacaccgtgc aatgtttcag ttgtcatgcg gcaatagata gatggcagta tggagactca 3 60 
gctgttggaa gacacaggag aatatcccca aattgcagat ttatcaatgg tttttatttt 42 0 
gaaaatggtg ctgcacagtc tacaaatcct ggtatccaaa atggccagta caaatctgaa 480 
aactgtgtgg gaaatagaaa tccttttgcc cctgacaggc cacctgagac tcatgctgat 54 0 
tatctcttga gaactggaca ggttgtagat atttcagaca ccatataccc gaggaaccct 600 
gccatgtgta gtgaagaagc cagattgaag tcatttcaga actggccgga ctatgctcat 660 
ttaaccccca gagagttagc tagtgctggc ctctactaca caggggctga tgatcaagtg 72 0 
caatgctttt gttgtggggg aaaactgaaa aattgggaac cctgtgatcg tgcctggtca 780 
gaacacagga gacactttcc caattgcttt tttgttttgg gccggaacgt taatgttcga 84 0 
agtgaatctg gtgtgagttc tgataggaat ttcccaaatt caacaaactc tccaagaaat 900 
ccagccatgg cagaatatga agcacggatc gttacttttg gaacatggat atactcagtt 960 
aacaaggagc agcttgcaag agctggattt tatgctttag gtgaaggcga taaagtgaag 102 0 
tgcttccact gtggaggagg gctcacggat tggaagccaa gtgaagaccc ctgggaccag 108 0 
catgctaagt gctacccagg gtgcaaatac ctattggatg agaaggggca agaatatata 1140 
aataatattc atttaaccca tccacttgag gaatctttgg gaagaactgc tgaaaaaaca 1200 
ccaccgctaa ctaaaaaaat cgatgatacc atcttccaga atcctatggt gcaagaagct 1260 
atacgaatgg gatttagctt caaggacctt aagaaaacaa tggaagaaaa aatccaaaca 1320 
tccgggagca gctatctatc acttgaggtc ctgattgcag atcttgtgag tgctcagaaa 13 80 
gataatacgg aggatgagtc aagtcaaact tcattgcaga aagacattag tactgaagag 144 0 
cagctaaggc gcctacaaga ggagaagctt tccaaaatct gtatggatag aaatattgct 1500 
atcgtttttt ttccttgtgg acatctggcc acttgtaaac agtgtgcaga agcagttgac 1560 
aaatgtccca tgtgctacac cgtcattacg ttcaaccaaa aaatttttat gtcttagtgg 1620 
ggcaccacat gttatgttct tcttgctcta attgaatgtg taatgggagc gaactttaag 1680 
taatcctgca tttgcattcc attagcatcc tgctgtttcc aaatggagac caatgctaac 1740 
agcactgttt ccgtctaaac attcaatttc tggatctttc gagttatcag ctgtatcatt 1800 
tagccagtgt tttactcgat tgaaacctta gacagagaag cattttatag cttttcacat 1860 
gtatattggt agtacactga cttgatttct atatgtaagt gaattcatca cctgcatgtt 192 0 
tcatgccttt tgcataagct taacaaatgg agtgttctgt ataagcatgg agatgtgatg 1980 
gaatctgccc aatgacttta attggcttat tgtaaacacg gaaagaactg ccccacgctg 2 04 0 
ctgggaggat aaagattgtt ttagatgctc acttctgtgt tttaggattc tgcccattta 2100 
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<210> 10 

<211> 496 

<212> PRT 

<213> Homo sapiens 



<400> 10 
Met Thr Phe Asn 
1 

Thr Asn Lys Asp 
20 

Phe Ala Asn Phe 
35 

Arg Ala Gly Phe 
50 

Ser Cys His Ala 
65 

Gly Arg His Arg 

Tyr Phe Glu Asn 
100 

Gly Gin Tyr Lys 
115 

Pro Asp Arg Pro 
130 

Gin Val Val Asp 
145 

Cys Ser Glu Glu 

Ala His Leu Thr 
180 

Gly Ala Asp Asp 
195 

Asn Trp Glu Pro 
210 

Pro Asn Cys Phe 
225 

Ser Gly Val Ser 

Arg Asn Pro Ala 
260 

Thr Trp lie Tyr 
275 

Tyr Ala Leu Gly 
290 

Gly Leu Thr Asp 
305 

Lys Cys Tyr Pro 

Tyr lie Asn Asn 
340 

Arg Thr Ala Glu 
355 

lie Phe Gin Asn 
370 



Ser Phe Glu Gly 
5 

Glu Glu Phe Val 

Pro Ser Ser Ser 
40 

Leu Tyr Thr Gly 
55 

Ala lie Asp Arg 
70 

Arg lie Ser Pro 
85 

Gly Ala Ala Gin 

Ser Glu Asn Cys 
120 

Pro Glu Thr His 
135 

lie Ser Asp Thr 
150 

Ala Arg Leu Lys 
165 

Pro Arg Glu Leu 

Gin Val Gin Cys 
200 

Cys Asp Arg Ala 
215 

Phe Val Leu Gly 
230 

Ser Asp Arg Asn 
245 

Met Ala Glu Tyr 

Ser Val Asn Lys 
280 

Glu Gly Asp Lys 
295 

Trp Lys Pro Ser 
310 

Gly Cys Lys Tyr 
325 

lie His Leu Thr 

Lys Thr Pro Pro 
360 

Pro Met Val Gin 
375 



Thr Arg Thr Phe 
10 

Glu Glu Phe Asn 
25 

Pro Val Ser Ala 

Glu Gly Asp Thr 
60 

Trp Gin Tyr Gly 
75 

Asn Cys Arg Phe 
90 

Ser Thr Asn Pro 
105 

Val Gly Asn Arg 

Ala Asp Tyr Leu 
140 

lie Tyr Pro Arg 
155 

Ser Phe Gin Asn 
170 

Ala Ser Ala Gly 
185 

Phe Cys Cys Gly 

Trp Ser Glu His 
220 

Arg Asn Val Asn 
235 

Phe Pro Asn Ser 
250 

Glu Ala Arg lie 
265 

Glu Gin Leu Ala 

Val Lys Cys Phe 
300 

Glu Asp Pro Trp 
315 

Leu Leu Asp Glu 
330 

His Pro Leu Glu 
345 

Leu Thr Lys Lys 

Glu Ala He Arg 
380 



Val Leu Ala Asp 
15 

Arg Leu Lys Thr 
30 

Ser Thr Leu Ala 
45 

Val Gin Cys Phe 

Asp Ser Ala Val 
80 

He Asn Gly Phe 
95 

Gly He Gin Asn 
110 

Asn Pro Phe Ala 
125 

Leu Arg Thr Gly 

Asn Pro Ala Met 
160 

Trp Pro Asp Tyr 
175 

Leu Tyr Tyr Thr 
190 

Gly Lys Leu Lys 
205 

Arg Arg His Phe 

Val Arg Ser Glu 
240 

Thr Asn Ser Pro 
255 

Val Thr Phe Gly 
270 

Arg Ala Gly Phe 
285 

His Cys Gly Gly 

Asp Gin His Ala 
320 

Lys Gly Gin Glu 
335 

Glu Ser Leu Gly 
350 

He Asp Asp Thr 
365 

Met Gly Phe Ser 



-11- 



Phe Lys Asp Leu 
385 

Ser Ser Tyr Leu 

Gin Lys Asp Asn 
420 

Asp lie Ser Thr 
435 

Ser Lys lie Cys 
450 

Gly His Leu Ala 
465 

Pro Met Cys Tyr 



Lys Lys Thr Met 
390 

Ser Leu Glu Val 
405 

Thr Glu Asp Glu 

Glu Glu Gin Leu 
440 

Met Asp Arg Asn 
455 

Thr Cys Lys Gin 
470 

Thr Val lie Thr 
485 



Glu Glu Lys lie 
395 

Leu lie Ala Asp 
410 

Ser Ser Gin Thr 
425 

Arg Arg Leu Gin 

lie Ala He Val 
460 

Cys Ala Glu Ala 
475 

Phe Asn Gin Lys 
490 



Gin Thr Ser Gly 
400 

Leu Val Ser Ala 
415 

Ser Leu Gin Lys 
430 

Glu Glu Lys Leu 
445 

Phe Phe Pro Cys 

Val Asp Lys Cys 
480 

He Phe Met Ser 
495 



<210> 11 
<211> 67 
<212> PRT 

<213> Orgyia pseudotsugata 
<400> 11 



Lys Ala Ala 


Arg 


Leu Gly 


Thr 


Tyr Thr Asn 


Trp 


Pro 


Val Gin 


Phe Leu 


1 




5 






10 








15 


Glu Pro Ser 


Arg 


Met 


Ala 


Ala 


Ser Gly Phe 


Tyr 


Tyr 


Leu Gly 


Arg Gly 




20 








25 






30 




Asp Glu Val 


Arg 


Cys 


Ala 


Phe 


Cys Lys Val 


Glu 


He 


Thr Asn 


Trp Val 


35 










40 






45 




Arg Gly Asp 


Asp 


Pro 


Glu 


Thr 


Asp His Lys 


Arg 


Trp 


Ala Pro 


Gin Cys 


50 








55 






60 







Pro Phe Val 



<210> 12 

<211> 275 

<212> PRT 

<213> Cydia 



pomonella 



<400> 12 
Met Ser Asp Leu 
1 

Trp Pro Val Ser 
20 

Tyr Tyr Leu Gly 
35 

Glu He Met Arg 
50 

Lys Trp Ala Pro 
65 

Ser He Val Thr 

He He Gly Pro 
100 



Arg Leu Glu Glu 
5 

Phe Leu Ser Pro 

Arg Ser Asp Glu 
40 

Trp Lys Glu Gly 
55 

Gin Cys Pro Phe 
70 

Thr Asn Asn He 
85 

Ala His Pro Lys 



Val Arg Leu Asn 
10 

Glu Thr Met Ala 
25 

Val Arg Cys Ala 

Glu Asp Pro Ala 
60 

Val Lys Gly He 
75 

Gin Asn Thr Thr 
90 

Tyr Ala His Glu 
105 



Thr Phe Glu Lys 
15 

Lys Asn Gly Phe 
30 

Phe Cys Lys Val 
45 

Ala Asp His Lys 

Asp Val Cys Gly 
80 

Thr His Asp Thr 
95 

Ala Ala Arg Val 
110 



-12- 



Lys 


Ser 


Phe 


His 


Asn 


Trp 


Pro 


Arg 


Cys 


Met 


Lys 


Gin 


Arg 


Pro 


Glu 


Gin 






115 










120 










125 








Met 


Ala 


Asp 


Ala 


Gly 


Phe 


Phe 


Tyr 


Thr 


Gly 


Tyr 


Gly 


Asp 


Asn 


Thr 


Lys 




130 










135 










140 










Cys 


Phe 


Tyr 


Cys 


Asp 


Gly 


Gly 


Leu 


Lys 


Asp 


Trp 


Glu 


Pro 


Glu 


Asp 


Val 


145 










150 










155 










160 


Pro 


Trp 


Glu 


Gin 


His 


Val 


Arg 


Trp 


Phe 


Asp 


Arg 


Cys 


Ala 


Tyr 


Val 


Gin 










165 










170 










175 




Leu 


Val 


Lys 


Gly 


Arg 


Asp 


Tyr 


Val 


Gin 


Lys 


Val 


He 


Thr 


Glu 


Ala 


C Y S 








180 










185 










190 






Val 


Leu 


Pro 


Gly 


Glu 


Asn 


Thr 


Thr 


Val 


Ser 


Thr 


Ala 


Ala 


Pro 


Val 


Ser 






195 










200 










205 








Glu 


Pro 


He 


Pro 


Glu 


Thr 


Lys 


He 


Glu 


Lys 


Glu 


Pro 


Gin 


Val 


Glu 


Asp 




210 










215 










220 










Ser 


Lys 


Leu 


Cys 


Lys 


He 


Cys 


Tyr 


Val 


Glu 


Glu 


Cys 


He 


Val 


Cys 


Phe 


225 










230 










235 










240 


Val 


Pro 


Cys 


Gly 


His 


Val 


Val 


Ala 


Cys 


Ala 


Lys 


Cys 


Ala 


Leu 


Ser 


Val 










245 










250 










255 




Asp 


Lys 


Cys 


Pro 


Met 


Cys 


Arg 


Lys 


He 


Val 


Thr 


Ser 


Val 


Leu 


Lys 


Val 








260 










265 










270 






Tyr 


Phe 


Ser 
































275 




























<210> 13 




























<211> 498 




























<212> PRT 




























<213> Drosophila melanogaster 


















<400> 13 




























Met 


Thr 


Glu 


Leu 


Gly 


Met 


Glu 


Leu 


Glu 


Ser 


Val 


Arg 


Leu 


Ala 


Thr 


Phe 


1 








5 










10 










15 




Gly 


Glu 


Trp 


Pro 


Leu 


Asn 


Ala 


Pro 


Val 


Ser 


Ala 


Glu 


Asp 


Leu 


Val 


Ala 








20 










25 










30 






Asn 


Gly 


Phe 


Phe 


Ala 


Thr 


Gly 


Lys 


Trp 


Leu 


Glu 


Ala 


Glu 


Cys 


His 


Phe 






35 










40 










45 








Cys 


His 


Val 


Arg 


He 


Asp 


Arg 


Trp 


Glu 


Tyr 


Gly 


Asp 


Gin 


Val 


Ala 


Glu 




50 










55 










60 










Arg 


His 


Arg 


Arg 


Ser 


Ser 


Pro 


He 


Cys 


Ser 


Met 


Val 


Leu 


Ala 


Pro 


Asn 


65 










70 










75 










80 


His 


Cys 


Gly 


Asn 


Val 


Pro 


Arg 


Ser 


Gin 


Glu 


Ser 


Asp 


Asn 


Glu 


Gly 


Asn 










85 










90 










95 




Ser 


Val 


Val 


Asp 


Ser 


Pro 


Glu 


Ser 


Cys 


Ser 


Cys 


Pro 


Asp 


Leu 


Leu 


Leu 








100 










105 










110 






Glu 


Ala 


Asn 


Arg 


Leu 


Val 


Thr 


Phe 


Lys 


Asp 


Trp 


Pro 


Asn 


Pro 


Asn 


He 






115 










120 










125 








Thr 


Pro 


Gin 


Ala 


Leu 


Ala 


Lys 


Ala 


Gly 


Phe 


Tyr 


Tyr 


Leu 


Asn 


Arg 


Leu 




130 










135 










140 










Asp 


His 


Val 


Lys 


Cys 


Val 


Trp 


Cys 


Asn 


Gly 


Val 


He 


Ala 


Lys 


Trp 


Glu 


145 










150 










155 










160 


Lys 


Asn 


Asp 


Asn 


Ala 


Phe 


Glu 


Glu 


His 


Lys 


Arg 


Phe 


Phe 


Pro 


Gin 


Cys 










165 










170 










175 




Pro 


Arg 


Val 


Gin 


Met 


Gly 


Pro 


Leu 


He 


Glu 


Phe 


Ala 


Thr 


Gly 


Lys 


Asn 








180 










185 










190 
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Leu Asp Glu Leu Gly lie Gin Pro Thr Thr Leu Pro Leu Arg Pro Lys 

195 200 205 

Tyr Ala Cys Val Asp Ala Arg Leu Arg Thr Phe Thr Asp Trp Pro lie 

210 215 220 

Ser Asn He Gin Pro Ala Ser Ala Leu Ala Gin Ala Gly Leu Tyr Tyr 
225 230 235 240 

Gin Lys He Gly Asp Gin Val Arg Cys Phe His Cys Asn He Gly Leu 

245 250 255 

Arg Ser Trp Gin Lys Glu Asp Glu Pro Trp Phe Glu His Ala Lys Trp 

260 265 270 

Ser Pro Lys Cys Gin Phe Val Leu Leu Ala Lys Gly Pro Ala Tyr Val 

275 280 285 

Ser Glu Val Leu Ala Thr Thr Ala Ala Asn Ala Ser Ser Gin Pro Ala 

290 295 300 

Thr Ala Pro Ala Pro Thr Leu Gin Ala Asp Val Leu Met Asp Glu Ala 
305 310 315 320 

Pro Ala Lys Glu Ala Leu Thr Leu Gly He Asp Gly Gly Val Val Arg 

325 330 335 

Asn Ala He Gin Arg Lys Leu Leu Ser Ser Gly Cys Ala Phe Ser Thr 

340 345 350 

Leu Asp Glu Leu Leu His Asp He Phe Asp Asp Ala Gly Ala Gly Ala 

355 360 365 

Ala Leu Glu Val Arg Glu Pro Pro Glu Pro Ser Ala Pro Phe He Glu 

370 375 380 

Pro Cys Gin Ala Thr Thr Ser Lys Ala Ala Ser Val Pro He Pro Val 
385 390 395 400 

Ala Asp Ser He Pro Ala Lys Pro Gin Ala Ala Glu Ala Val Ser Asn 

405 410 415 

He Ser Lys He Thr Asp Glu He Gin Lys Met Ser Val Ser Thr Pro 

420 425 430 

Asn Gly Asn Leu Ser Leu Glu Glu Glu Asn Arg Gin Leu Lys Asp Ala 

435 440 445 

Arg Leu Cys Lys Val Cys Leu Asp Glu Glu Val Gly Val Val Phe Leu 

450 455 460 

Pro Cys Gly His Leu Ala Thr Cys Asn Gin Cys Ala Pro Ser Val Ala 
465 470 475 480 

Asn Cys Pro Met Cys Arg Ala Asp He Lys Gly Phe Val Arg Thr Phe 

485 490 495 

Leu Ser 



<210> 14 
<211> 67 
<212> PRT 

<213> Cydia pomonella 
<400> 14 

Glu Glu Val Arg Leu Asn Thr Phe 

1 5 
Ser Pro Glu Thr Met Ala Lys Asn 
20 

Asp Glu Val Arg Cys Ala Phe Cys 
35 40 



Glu Lys Trp Pro Val Ser Phe Leu 

10 15 
Gly Phe Tyr Tyr Leu Gly Arg Ser 
25 30 
Lys Val Glu He Met Arg Trp Lys 

45 
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Glu Gly Glu Asp Pro Ala Ala Asp His Lys Lys Trp Ala Pro Gin Cys 

50 55 60 

Pro Phe Val 
65 



<210> 15 
<211> 67 
<212> PRT 

<213> Homo sapiens 
<400> 15 

Glu Ala Asn Arg Leu Val Thr Phe Lys Asp Trp Pro Asn Pro Asn lie 

15 10 15 

Thr Pro Gin Ala Leu Ala Lys Ala Gly Phe Tyr Tyr Leu Asn Arg Leu 

20 25 30 

Asp His Val Lys Cys Val Trp Cys Asn Gly Val He Ala Lys Trp Glu 

35 40 45 

Lys Asn Asp Asn Ala Phe Glu Glu His Lys Arg Phe Phe Pro Gin Cys 

50 55 60 

Pro Arg Val 
65 



<210> 16 

<211> 68 

<212> PRT 

<213> Mus musculus 



<400> 16 
Glu Phe Asn Arg 
1 

Val Ser Ala Ser 
20 

Gly Asp Thr Val 
35 

Gin Tyr Gly Asp 
50 

Cys Arg Phe He 
65 



Leu Lys Thr Phe 
5 

Thr Leu Ala Arg 

Gin Cys Phe Ser 
40 

Ser Ala Val Gly 
55 



Ala Asn Phe Pro 
10 

Ala Gly Phe Leu 
25 

Cys His Ala Ala 

Arg His Arg Arg 
60 



Ser Ser Ser Pro 
15 

Tyr Thr Gly Glu 
30 

He Asp Arg Trp 
45 

He Ser Pro Asn 



<210> 17 
<211> 68 
<212> PRT 

<213> Homo sapiens 
<400> 17 

Glu Phe Asn Arg Leu Lys Thr Phe Ala Asn Phe Pro Ser Gly Ser Pro 

15 10 15 

Val Ser Ala Ser Thr Leu Ala Arg Ala Gly Phe Leu Tyr Thr Gly Glu 

20 25 30 

Gly Asp Thr Val Arg Cys Phe Ser Cys His Ala Ala Val Asp Arg Trp 
35 40 45 
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Gin Tyr Gly Asp Ser Ala Val Gly Arg His Arg Lys Val Ser Pro Asn 

50 55 60 

Cys Arg Phe lie 
65 



<210> 18 
<211> 68 
<212> PRT 

<213> Homo sapiens 



<400> 18 
Glu Leu Tyr Arg 
1 

Val Ser Glu Arg 
20 

Asn Asp Lys Val 
35 

Lys Arg Gly Asp 
50 

Cys Arg Phe Val 
65 



Met Ser Thr Tyr 
5 

Ser Leu Ala Arg 

Lys Cys Phe Cys 
40 

Ser Pro Thr Glu 
55 



Ser Thr Phe Pro 
10 

Ala Gly Phe Tyr 
25 

Cys Gly Leu Met 

Lys His Lys Lys 
60 



Ala Gly Val Pro 
15 

Tyr Thr Gly Val 
30 

Leu Asp Asn Trp 
45 

Leu Tyr Pro Ser 



<210> 19 

<211> 68 

<212> PRT 

<213> Homo sapiens 



<400> 19 
Glu Leu Tyr Arg 
1 

Val Ser Glu Arg 
20 

Asn Asp Lys Val 
35 

Lys Leu Gly Asp 
50 

Cys Ser Phe lie 
65 



Met Ser Thr Tyr 
5 

Ser Leu Ala Arg 

Lys Cys Phe Cys 
40 

Ser Pro lie Gin 
55 



Ser Thr Phe Pro 
10 

Ala Gly Phe Tyr 
25 

Cys Gly Leu Met 

Lys His Lys Gin 
60 



Ala Gly Val Pro 
15 

Tyr Thr Gly Val 
30 

Leu Asp Asn Trp 
45 

Leu Tyr Pro Ser 



<210> 20 

<211> 68 

<212> PRT 

<213> Mus musculus 



<400> 20 
Glu Glu Ala Arg 
1 

Leu Thr Pro Arg 
20 

Asp Asp Gin Val 
35 



Leu Lys Ser Phe 
5 

Glu Leu Ala Ser 

Gin Cys Phe Cys 
40 



Gin Asn Trp Pro 
10 

Ala Gly Leu Tyr 
25 

Cys Gly Gly Lys 



Asp Tyr Ala His 
15 

Tyr Thr Gly Ala 
30 

Leu Lys Asn Trp 
45 
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Glu Pro Cys Asp Arg Ala Trp Ser Glu His Arg Arg His Phe Pro Asn 

50 55 60 

Cys Phe Phe Val 
65 



<210> 21 
<211> 68 
<212> PRT 

<213> Homo sapiens 



<400> 21 
Glu Glu Ala Arg 
1 

Leu Thr Pro Arg 
20 

Gly Asp Gin Val 
35 

Glu Pro Cys Asp 
50 

Cys Phe Phe Val 
65 



Leu Lys Ser Phe 
5 

Glu Leu Ala Ser 

Gin Cys Phe Cys 
40 

Arg Ala Trp Ser 
55 



Gin Asn Trp Pro 
10 

Ala Gly Leu Tyr 
25 

Cys Gly Gly Lys 

Glu His Arg Arg 
60 



Asp Tyr Ala His 
15 

Tyr Thr Gly lie 
30 

Leu Lys Asn Trp 
45 

His Phe Pro Asn 



<210> 22 
<211> 67 
<212> PRT 

<213> Homo sapiens 



<400> 22 
Glu Asn Ala Arg 
1 

Ser Pro Thr Asp 
20 

Asp Arg Val Ala 
35 

Pro Lys Asp Asn 
50 

Pro Phe lie 
65 



Leu Leu Thr Phe 
5 

Leu Ala Arg Ala 

Cys Phe Ala Cys 
40 

Ala Met Ser Glu 
55 



Gin Thr Trp Pro 
10 

Gly Phe Tyr Tyr 
25 

Gly Gly Lys Leu 

His Leu Arg His 
60 



Leu Thr Phe Leu 
15 

lie Gly Pro Gly 
30 

Ser Asn Trp Glu 
45 

Phe Pro Lys Cys 



<210> 23 
<211> 67 
<212> PRT 

<213> Homo sapiens 
<400> 23 

Glu Glu Ala Arg Phe Leu Thr Tyr His Met Trp Pro Leu Thr Phe Leu 

15 10 15 

Ser Pro Ser Glu Leu Ala Arg Ala Gly Phe Tyr Tyr He Gly Pro Gly 

20 25 30 

Asp Arg Val Ala Cys Phe Ala Cys Gly Gly Lys Leu Ser Asn Trp Glu 
35 40 45 
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Pro Lys Asp Asp Ala Met Ser Glu His Arg Arg His Phe Pro Asn Cys 

50 55 60 

Pro Phe Leu 
65 



<210> 24 

<211> 66 

<212> PRT 

<213> Mus musculus 



<400> 24 
Tyr Glu Ala Arg 
1 

Lys Glu Gin Leu 
20 

Lys Val Lys Cys 
35 

Ser Glu Asp Pro 
50 

Tyr Leu 
65 



lie Val Thr Phe 
5 

Ala Arg Ala Gly 

Phe His Cys Gly 
40 

Trp Asp Gin His 
55 



Gly Thr Trp lie 
10 

Phe Tyr Ala Leu 
25 

Gly Gly Leu Thr 

Ala Lys Cys Tyr 
60 



Tyr Ser Val Asn 
15 

Gly Glu Gly Asp 
30 

Asp Trp Lys Pro 
45 

Pro Gly Cys Lys 



<210> 25 

<211> 66 

<212> PRT 

<213> Homo sapiens 



<400> 25 
Tyr Glu Ala Arg 
1 

Lys Glu Gin Leu 
20 

Lys Val Lys Cys 
35 

Ser Glu Asp Pro 
50 

Tyr Leu 
65 



lie Phe Thr Phe 
5 

Ala Arg Ala Gly 

Phe His Cys Gly 
40 

Trp Glu Gin His 
55 



Gly Thr Trp lie 
10 

Phe Tyr Ala Leu 
25 

Gly Gly Leu Thr 

Ala Lys Trp Tyr 
60 



Tyr Ser Val Asn 
15 

Gly Glu Gly Asp 
30 

Asp Trp Lys Pro 
45 

Pro Gly Cys Lys 



<210> 26 

<211> 68 

<212> PRT 

<213> Homo sapiens 



<400> 26 

His Ala Ala Arg Phe Lys Thr Phe 

1 5 
Val Asn Pro Glu Gin Leu Ala Ser 
20 

Ser Asp Asp Val Lys Cys Phe Cys 
35 40 



Phe Asn Trp Pro Ser Ser Val Leu 

10 15 
Ala Gly Phe Tyr Tyr Val Gly Asn 
25 30 
Cys Asp Gly Gly Leu Arg Cys Trp 

45 
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Glu Ser Gly Asp Asp Pro Trp Val Gin His Ala Lys Trp Phe Pro Arg 

50 55 60 

Cys Glu Tyr Leu 
65 



<210> 27 
<211> 68 
<212> PRT 

<213> Homo sapiens 



<400> 27 
His Ala Ala Arg 
1 

Val Gin Pro Glu 
20 

Asn Asp Asp Val 
35 

Glu Ser Gly Asp 
50 

Cys Glu Phe Leu 
65 



Met Arg Thr Phe 
5 

Gin Leu Ala Ser 

Lys Cys Phe Gly 
40 

Asp Pro Trp Val 
55 



Met Tyr Trp Pro 
10 

Ala Gly Phe Tyr 
25 

Cys Asp Gly Gly 

Glu His Ala Lys 
60 



Ser Ser Val Pro 
15 

Tyr Val Gly Arg 
30 

Leu Arg Cys Trp 
45 

Trp Phe Pro Arg 



<210> 28 

<211> 68 

<212> PRT 

<213> Orgyia pseudotsugata 



<400> 28 

Glu Ala Ala Arg Leu Arg Thr Phe 

1 5 
Gin Arg Pro Glu Glu Leu Ala Glu 
20 

Gly Asp Lys Thr Arg Cys Phe Cys 

35 40 
Glu Pro Asp Asp Ala Pro Trp Gin 

50 55 
Cys Glu Tyr Val 
65 



Ala Glu Trp Pro Arg Gly Leu Lys 

10 15 
Ala Gly Phe Phe Tyr Thr Gly Gin 
25 30 
Cys Asp Gly Gly Leu Lys Asp Trp 

45 

Gin His Ala Arg Trp Tyr Asp Arg 
60 



<210> 29 

<211> 68 

<212> PRT 

<213> Cydia 



pomonella 



<400> 29 

Glu Ala Ala Arg Val Lys Ser Phe 

1 5 
Gin Arg Pro Glu Gin Met Ala Asp 
20 

Gly Asp Asn Thr Lys Cys Phe Tyr 
35 40 



His Asn Trp Pro Arg Cys Met Lys 

10 15 
Ala Gly Phe Phe Tyr Thr Gly Tyr 
25 30 
Cys Asp Gly Gly Leu Lys Asp Trp 

45 
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Glu Pro Glu Asp Val Pro Trp Glu Gin His Val Arg Trp Phe Asp Arg 

50 55 60 

Cys Ala Tyr Val 
65 



<210> 30 
<211> 68 
<212> PRT 

<213> Drosophila melanogaster 
<400> 30 

Val Asp Ala Arg Leu Arg Thr Phe Thr Asp Trp Pro He Ser Asn He 

15 10 15 

Gin Pro Ala Ser Ala Leu Ala Gin Ala Gly Leu Tyr Tyr Gin Lys He 

20 25 30 

Gly Asp Gin Val Arg Cys Phe His Cys Asn He Gly Leu Arg Ser Trp 

35 40 45 

Gin Lys Glu Asp Glu Pro Trp Phe Glu His Ala Lys Trp Ser Pro Lys 

50 55 60 

Cys Gin Phe Val 
65 



<210> 31 
<211> 66 
<212> PRT 

<213> Drosophila melanogaster 
<400> 31 

Glu Ser Val Arg Leu Ala Thr Phe Gly Glu Trp Pro Leu Asn Ala Pro 

15 10 15 

Val Ser Ala Glu Asp Leu Val Ala Asn Gly Phe Phe Gly Thr Trp Met 

20 25 30 

Glu Ala Glu Cys Asp Phe Cys His Val Arg He Asp Arg Trp Glu Tyr 

35 40 45 

Gly Asp Leu Val Ala Glu Arg His Arg Arg Ser Ser Pro He Cys Ser 

50 55 60 

Met Val 
65 



<210> 32 
<211> 46 
<212> PRT 

<213> Homo sapiens 
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<400> 32 

Glu Gin Leu Arg Arg Leu Gin Glu Glu Arg Thr Cys Lys Val Cys Met 

15 10 15 

Asp Lys Glu Val Ser Val Val Phe lie Pro Cys Gly His Leu Val Val 

20 25 30 

Cys Gin Glu Cys Ala Pro Ser Leu Arg Lys Cys Pro lie Cys 
35 40 45 



<210> 33 
<211> 46 
<212> PRT 

<213> Homo sapiens 
<400> 33 

Glu Gin Leu Arg Arg Leu Pro Glu Glu Arg Thr Cys Lys Val Cys Met 

15 10 15 

Asp Lys Glu Val Ser He Val Phe He Pro Cys Gly His Leu Val Val 

20 25 30 

Cys Lys Asp Cys Ala Pro Ser Leu Arg Lys Cys Pro He Cys 
35 40 45 



<210> 34 
<211> 46 
<212> PRT 

<213> Homo sapiens 
<400> 34 

Glu Gin Leu Arg Arg Leu Gin Glu Glu Lys Leu Ser Lys He Cys Met 

15 10 15 

Asp Arg Asn He Ala He Val Phe Phe Pro Cys Gly His Leu Ala Thr 

20 25 30 

Cys Lys Gin Cys Ala Glu Ala Val Asp Lys Cys Pro Met Cys 
35 40 45 



<210> 35 
<211> 46 
<212> PRT 

<213> Homo sapiens 
<400> 35 

Glu Gin Leu Arg Arg Leu Gin Glu Glu Lys Leu Cys Lys He Cys Met 

15 10 15 

Asp Arg Asn He Ala He Val Phe Val Pro Cys Gly His Leu Val Thr 

20 25 30 

Cys Lys Gin Cys Ala Glu Ala Val Asp Lys Cys Pro Met Cys 
35 40 45 
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<210> 36 
<211> 46 
<212> PRT 

<213> Drosophila melanogaster 
<400> 36 

Glu Glu Asn Arg Gin Leu Lys Asp Ala Arg Leu Cys Lys Val Cys Leu 

15 10 15 

Asp Glu Glu Val Gly Val Val Phe Leu Pro Cys Gly His Leu Ala Thr 

20 25 30 

Cys Asn Gin Cys Ala Pro Ser Val Ala Asn Cys Pro Met Cys 
35 40 45 



<210> 37 
<211> 46 
<212> PRT 

<213> Cydia pomonella 
<400> 37 

Glu Lys Glu Pro Gin Val Glu Asp Ser Lys Leu Cys Lys He Cys Tyr 

15 10 15 

Val Glu Glu Cys He Val Cys Phe Val Pro Cys Gly His Val Val Ala 

20 25 30 

Cys Ala Lys Cys Ala Leu Ser Val Asp Lys Cys Pro Met Cys 
35 40 45 



<210> 38 
<211> 46 
<212> PRT 

<213> Orgyia pseudotsugata 
<400> 38 

Ala Val Glu Ala Glu Val Ala Asp Asp Arg Leu Cys Lys He Cys Leu 

15 10 15 

Gly Ala Glu Lys Thr Val Cys Phe Val Pro Cys Gly His Val Val Ala 

20 25 30 

Cys Gly Lys Cys Ala Ala Gly Val Thr Thr Cys Pro Val Cys 
35 40 45 



<210> 39 

<211> 2474 

<212> DNA 

<213> Mus musculus 

<400> 39 

gaattccggg agacctacac ccccggagat 
gaagtgggct gcggtatcag cctagcagta 
atccccagag aaagacttgt cccttcccct 
gacagcgcct ttctagccaa gctgatgaag 
ttttcctgtg agctgtaccg attgtccacg 



cagaggtcat tgctggcgtt cagagcctag 60 
aaaccgacca gaagccatgc acaaaactac 12 0 
ccctgtcatc tcaccatgaa catggttcaa 180 
agtgctgaca cctttgagtt gaagtatgac 240 
tattcagctt ttcccagggg agttcctgtg 300 
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tcagaaagga gtctggctcg tgctggcttt tactacactg gtgccaatga caaggtcaag 3 60 
tgcttctgct gtggcctgat gctagacaac tggaaacaag gggacagtcc catggagaag 42 0 
cacagaaagt tgtaccccag ctgcaacttt gtacagactt tgaatccagc caacagtctg 48 0 
gaagctagtc ctcggccttc tcttccttcc acggcgatga gcaccatgcc tttgagcttt 54 0 
gcaagttctg agaatactgg ctatttcagt ggctcttact cgagctttcc ctcagaccct 600 
gtgaacttcc gagcaaatca agattgtcct gctttgagca caagtcccta ccactttgca 660 
atgaacacag agaaggccag attactcacc tatgaaacat ggccattgtc ttttctgtca 72 0 
ccagcaaagc tggccaaagc aggcttctac tacataggac ctggagatag agtggcctgc 78 0 
tttgcgtgcg atgggaaact gagcaactgg gaacgtaagg atgatgctat gtcagagcac 84 0 
cagaggcatt tccccagctg tccgttctta aaagacttgg gtcagtctgc ttcgagatac 900 
actgtctcta acctgagcat gcagacacac gcagcccgta ttagaacatt ctctaactgg 960 
ccttctagtg cactagttca ttcccaggaa cttgcaagtg cgggctttta ttatacagga 1020 
cacagtgatg atgtcaagtg tttatgctgt gatggtgggc tgaggtgctg ggaatctgga 1080 
gatgacccct gggtggaaca tgccaagtgg tttccaaggt gtgagtactt gctcagaatc 114 0 
aaaggccaag aatttgtcag ccaagttcaa gctggctatc ctcatctact tgagcagcta 1200 
ttatctacgt cagactcccc agaagatgag aatgcagacg cagcaatcgt gcattttggc 1260 
cctggagaaa gttcggaaga tgtcgtcatg atgagcacgc ctgtggttaa agcagccttg 1320 
gaaatgggct tcagtaggag cctggtgaga cagacggttc agtggcagat cctggccact 13 80 
ggtgagaact acaggaccgt cagtgacctc gttataggct tactcgatgc agaagacgag 1440 
atgagagagg agcagatgga gcaggcggcc gaggaggagg agtcagatga tctagcacta 1500 
atccggaaga acaaaatggt gcttttccaa catttgacgt gtgtgacacc aatgctgtat 1560 
tgcctcctaa gtgcaagggc catcactgaa caggagtgca atgctgtgaa acagaaacca 162 0 
cacaccttac aagcaagcac actgattgat actgtgttag caaaaggaaa cactgcagca 1680 
acctcattca gaaactccct tcgggaaatt gaccctgcgt tatacagaga tatatttgtg 1740 
caacaggaca ttaggagtct tcccacagat gacattgcag ctctaccaat ggaagaacag 1800 
ttgcggcccc tcccggagga cagaatgtgt aaagtgtgta tggaccgaga ggtatccatc 1860 
gtgttcattc cctgtggcca tctggtcgtg tgcaaagact gcgctccctc tctgaggaag 1920 
tgtcccatct gtagagggac catcaagggc acagtgcgca catttctctc ctgaacaaga 1980 
ctaatggtcc atggctgcaa cttcagccag gaggaagttc actgtcactc ccagttccat 2 040 
tcggaacttg aggccagcct ggatagcacg agacaccgcc aaacacacaa atataaacat 210 0 
gaaaaacttt tgtctgaagt caagaatgaa tgaattactt atataataat tttaattggt 2160 
ttccttaaaa gtgctatttg ttcccaactc agaaaattgt tttctgtaaa catatttaca 2220 
tactacctgc atctaaagta ttcatatatt catatattca gatgtcatga gagagggttt 2280 
tgttcttgtt cctgaaaagc tggtttatca tctgatcagc atatactgcg caacgggcag 2340 
ggctagaatc catgaaccaa gctgcaaaga tctcacgcta aataaggcgg aaagatttgg 240 0 
agaaacgaaa ggaaattctt tcctgtccaa tgtatactct tcagactaat gacctcttcc 2460 
tatcaagcct tcta 2474 

<210> 40 
<211> 602 
<212> PRT 

<213> Mus musculus 
<400> 40 

Met Asn Met Val Gin Asp Ser Ala Phe Leu Ala Lys Leu Met Lys Ser 

15 10 15 

Ala Asp Thr Phe Glu Leu Lys Tyr Asp Phe Ser Cys Glu Leu Tyr Arg 

20 25 30 

Leu Ser Thr Tyr Ser Ala Phe Pro Arg Gly Val Pro Val Ser Glu Arg 

35 40 45 

Ser Leu Ala Arg Ala Gly Phe Tyr Tyr Thr Gly Ala Asn Asp Lys Val 

50 55 60 

Lys Cys Phe Cys Cys Gly Leu Met Leu Asp Asn Trp Lys Gin Gly Asp 
65 70 75 80 
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Ser Pro Met Glu 



Gin 


Thr 


Leu 


Asn 








100 


Leu 


Pro 


Ser 


Thr 






115 




Glu 


Asn 


Thr 


Gly 




130 






Pro 


Val 


Asn 


Phe 


145 








Pro 


Tyr His 


Phe 


Glu 


Thr 


Trp 


Pro 








180 


Gly 


Phe 


Tyr 


Tyr 






195 




Asp 


Gly Lys 


Leu 




210 






His 


Gin Arg 


His 


225 








Ser 


Ala 


Ser 


Arg 


Ala 


Arg 


He 


Arg 








260 


Ser 


Gin 


Glu 


Leu 






275 




Asp 


Val 


Lys 


Cys 




290 






Gly 


Asp 


Asp 


Pro 


305 








Tyr 


Leu 


Leu 


Arg 


Gly 


Tyr 


Pro 


His 








340 


Glu 


Asp 


Glu 


Asn 






355 




Ser 


Ser 


Glu 


Asp 




370 






Leu 


Glu 


Met 


Gly 


385 








Gin 


lie 


Leu 


Ala 


lie 


Gly Leu 


Leu 








420 


Gin 


Ala 


Ala 


Glu 






435 




Asn 


Lys 


Met 


Val 




450 






Tyr 


Cys 


Leu 


Leu 



465 

Val Lys Gin Lys 

Val Leu Ala Lys 
500 



Lys His Arg Lys 
85 

Pro Ala Asn Ser 

Ala Met Ser Thr 
120 

Tyr Phe Ser Gly 
135 

Arg Ala Asn Gin 
150 

Ala Met Asn Thr 
165 

Leu Ser Phe Leu 

He Gly Pro Gly 
200 

Ser Asn Trp Glu 
215 

Phe Pro Ser Cys 
230 

Tyr Thr Val Ser 
245 

Thr Phe Ser Asn 

Ala Ser Ala Gly 
280 

Leu Cys Cys Asp 
295 

Trp Val Glu His 
310 

He Lys Gly Gin 
325 

Leu Leu Glu Gin 

Ala Asp Ala Ala 
360 

Val Val Met Met 
375 

Phe Ser Arg Ser 
390 

Thr Gly Glu Asn 
405 

Asp Ala Glu Asp 

Glu Glu Glu Ser 
440 

Leu Phe Gin His 
455 

Ser Ala Arg Ala 
470 

Pro His Thr Leu 
485 

Gly Asn Thr Ala 



Leu Tyr Pro Ser 
90 

Leu Glu Ala Ser 
105 

Met Pro Leu Ser 

Ser Tyr Ser Ser 
140 

Asp Cys Pro Ala 
155 

Glu Lys Ala Arg 
170 

Ser Pro Ala Lys 
185 

Asp Arg Val Ala 

Arg Lys Asp Asp 
220 

Pro Phe Leu Lys 
235 

Asn Leu Ser Met 
250 

Trp Pro Ser Ser 
265 

Phe Tyr Tyr Thr 

Gly Gly Leu Arg 
300 

Ala Lys Trp Phe 
315 

Glu Phe Val Ser 
330 

Leu Leu Ser Thr 
345 

He Val His Phe 

Ser Thr Pro Val 
380 

Leu Val Arg Gin 
395 

Tyr Arg Thr Val 
410 

Glu Met Arg Glu 
425 

Asp Asp Leu Ala 

Leu Thr Cys Val 
460 

He Thr Glu Gin 
475 

Gin Ala Ser Thr 
490 

Ala Thr Ser Phe 
505 



Cys Asn Phe Val 
95 

Pro Arg Pro Ser 
110 

Phe Ala Ser Ser 
125 

Phe Pro Ser Asp 

Leu Ser Thr Ser 
160 

Leu Leu Thr Tyr 
175 

Leu Ala Lys Ala 
190 

Cys Phe Ala Cys 
205 

Ala Met Ser Glu 

Asp Leu Gly Gin 
240 

Gin Thr His Ala 
255 

Ala Leu Val His 
270 

Gly His Ser Asp 
285 

Cys Trp Glu Ser 

Pro Arg Cys Glu 
320 

Gin Val Gin Ala 
335 

Ser Asp Ser Pro 
350 

Gly Pro Gly Glu 
365 

Val Lys Ala Ala 

Thr Val Gin Trp 
400 

Ser Asp Leu Val 
415 

Glu Gin Met Glu 
430 

Leu He Arg Lys 
445 

Thr Pro Met Leu 

Glu Cys Asn Ala 
480 

Leu He Asp Thr 
495 

Arg Asn Ser Leu 
510 
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Arg Glu lie Asp 
515 

lie Arg Ser Leu 
530 

Gin Leu Arg Pro 
545 

Arg Glu Val Ser 

Lys Asp Cys Ala 
580 

lie Lys Gly Thr 
595 



Pro Ala Leu Tyr 
520 

Pro Thr Asp Asp 
535 

Leu Pro Glu Asp 
550 

He Val Phe He 
565 

Pro Ser Leu Arg 

Val Arg Thr Phe 
600 



Arg Asp He Phe 

He Ala Ala Leu 
540 

Arg Met Cys Lys 
555 

Pro Cys Gly His 
570 

Lys Cys Pro He 
585 

Leu Ser 



Val Gin Gin Asp 
525 

Pro Met Glu Glu 

Val Cys Met Asp 
560 

Leu Val Val Cys 
575 

Cys Arg Gly Thr 
590 



<210> 41 

<211> 2416 

<212> DNA 

<213> Mus musculus 



<400> 41 

ctgtggtgga gatctattgt ccaagtggtg 
gaaatactat tactactcat ggacaaaact 
cacccaaaaa cttaaacgta taatggagaa 
gagcgaagaa aaaatgaagt ttgacttttc 
agcttttccc aggggagttc ctgtctcaga 
tacaggtgtg aatgacaaag tcaagtgctt 
acaaggggac agtcctgttg aaaagcacag 
gactctgctt tcagccagtc tgcagtctcc 
atttgcacat tcgtcacctc tggaacgagg 
tcttaattct agagcagtgg aagacttctc 
gagtacagaa gaggccagat ttcttactta 
agcagagctg gccagagctg gcttctatta 
tgcctgtggt gggaaactga gcaactggga 
cagacatttt ccccactgtc catttctgga 
atcaaatcta agtatgcaga cacactctgc 
tagtgttcct gttcagcccg agcagcttgc 
tgatgatgtc aagtgccttt gttgtgatgg 
cccctggata gaacacgcca aatggtttcc 
tcaggagttt gttgatgaga ttcaagctag 
cacttcagac accccaggag aagaaaatgc 
ccctggagaa agttcgaaag atgtcgtcat 
ggaaatgggc ttcagtagga gcctggtgag 
tggtgagaac tacaggaccg tcaatgatat 
gagaagagaa gaggagaagg aaagacagac 
gattcggaag aatagaatgg ccctctttca 
taatcttctt gaggccagtg taattacaaa 
acagataccc ttacaagcaa gagagcttat 
agccaacatc ttcaaaaact ctctgaaggg 
tgtggaaaag aatatgaagt atattccaac 
gcagttgcgg agattacaag aagaacgaac 
tattgtgttc attccgtgtg gtcatctagt 
gaagtgcccc atctgcaggg ggacaatcaa 
gaagaatggt ctgaaagtat tgttggacat 
tgatttcagc tcttcagcag gacattctac 



agaaacttca tctggaagtt taagcggtca 60 
gtctcccaga gactcgccca aggtacctta 12 0 
gagcacaatc ttgtcaaatt ggacaaagga 180 
gtgtgaactc taccgaatgt ctacatattc 240 
gaggagtctg gctcgtgctg gcttttatta 300 
ctgctgtggc ctgatgttgg ataactggaa 360 
acagttctat cccagctgca gctttgtaca 420 
atctaagaat atgtctcctg tgaaaagtag 480 
tggcattcac tccaacctgt gctctagccc 540 
atcaaggatg gatccctgca gctatgccat 600 
cagtatgtgg cctttaagtt ttctgtcacc 660 
catagggcct ggagacaggg tggcctgttt 72 0 
accaaaggat tatgctatgt cagagcaccg 78 0 
aaatacttca gaaacacaga ggtttagtat 84 0 
tcgattgagg acatttctgt actggccacc 900 
aagtgctgga ttctattacg tggatcgcaa 960 
tggcttgaga tgttgggaac ctggagatga 1020 
aaggtgtgag ttcttgatac ggatgaaggg 10 80 
atatcctcat cttcttgagc agctgttgtc 1140 
tgaccctaca gagacagtgg tgcattttgg 1200 
gatgagcacg cctgtggtta aagcagcctt 12 60 
acagacggtt cagcggcaga tcctggccac 1320 
tgtctcagta cttttgaatg ctgaagatga 1380 
tgaagagatg gcatcaggtg acttatcact 1440 
acagttgaca catgtccttc ctatcctgga 1500 
acaggaacat gatattatta gacagaaaac 1560 
tgacaccgtt ttagtcaagg gaaatgctgc 1620 
aattgactcc acgttatatg aaaacttatt 1680 
agaagacgtt tcaggcttgt cattggaaga 1740 
ttgcaaagtg tgtatggaca gagaggtttc 1800 
agtctgccag gaatgtgccc cttctctaag 1860 
ggggactgtg cgcacatttc tctcatgagt 1920 
cagaagctgt cagaacaaag aatgaactac 1980 
tctctttcaa gattagtaat cttgctttat 2040 
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gaagggtagc attgtatatt taagcttagt ctgttgcaag ggaaggtcta tgctgttgag 2100 
ctacaggact gtgtctgttc cagagcagga gttgggatgc ttgctgtatg tccttcagga 2160 
cttcttggga tttgggaatt tggggaaagc tttggaatcc agtgatgtgg agctcagaaa 2220 
tcctggaacc agtgactctg gtactcagta gatagggtac cctgtacttc ttggtgcttt 2280 
tccagtctgg gaaataagga ggaatctgct gctggtaaaa atttgctgga tgtgagaaat 2340 
agatgaaagt gtttcgggtg ggggcgtgca tcagtgtagt gtgtgcaggg atgtatgcag 2400 
gccaaacact gtgtag 2416 

<210> 42 
<211> 591 
<212> PRT 

<213> Mus musculus 



<400> 42 
Met Glu Lys Ser 
1 

Lys Met Lys Phe 
20 

Ser Ala Phe Pro 
35 

Ala Gly Phe Tyr 
50 

Cys Gly Leu Met 
65 

Lys His Arg Gin 

Ser Ala Ser Leu 
100 

Arg Phe Ala His 
115 

Leu Cys Ser Ser 
130 

Arg Met Asp Pro 
145 

Leu Thr Tyr Ser 

Ala Arg Ala Gly 
180 

Phe Ala Cys Gly 

195 

Met Ser Glu His 
210 

Thr Ser Glu Thr 
225 

His Ser Ala Arg 

Val Gin Pro Glu 
260 

Asn Asp Asp Val 
275 

Glu Pro Gly Asp 
290 

Cys Glu Phe Leu 
305 



Thr lie Leu Ser 
5 

Asp Phe Ser Cys 

Arg Gly Val Pro 
40 

Tyr Thr Gly Val 
55 

Leu Asp Asn Trp 
70 

Phe Tyr Pro Ser 
85 

Gin Ser Pro Ser 

Ser Ser Pro Leu 
120 

Pro Leu Asn Ser 
135 

Cys Ser Tyr Ala 
150 

Met Trp Pro Leu 
165 

Phe Tyr Tyr lie 

Gly Lys Leu Ser 
200 

Arg Arg His Phe 
215 

Gin Arg Phe Ser 
230 

Leu Arg Thr Phe 
245 

Gin Leu Ala Ser 

Lys Cys Leu Cys 
280 

Asp Pro Trp lie 
295 

lie Arg Met Lys 
310 



Asn Trp Thr Lys 
10 

Glu Leu Tyr Arg 
25 

Val Ser Glu Arg 

Asn Asp Lys Val 
60 

Lys Gin Gly Asp 
75 

Cys Ser Phe Val 
90 

Lys Asn Met Ser 
105 

Glu Arg Gly Gly 

Arg Ala Val Glu 
140 

Met Ser Thr Glu 
155 

Ser Phe Leu Ser 
170 

Gly Pro Gly Asp 
185 

Asn Trp Glu Pro 

Pro His Cys Pro 
220 

lie Ser Asn Leu 
235 

Leu Tyr Trp Pro 
250 

Ala Gly Phe Tyr 
265 

Cys Asp Gly Gly 

Glu His Ala Lys 
300 

Gly Gin Glu Phe 
315 



Glu Ser Glu Glu 
15 

Met Ser Thr Tyr 
30 

Ser Leu Ala Arg 
45 

Lys Cys Phe Cys 

Ser Pro Val Glu 
80 

Gin Thr Leu Leu 
95 

Pro Val Lys Ser 
110 

lie His Ser Asn 
125 

Asp Phe Ser Ser 

Glu Ala Arg Phe 
160 

Pro Ala Glu Leu 
175 

Arg Val Ala Cys 
190 

Lys Asp Tyr Ala 
205 

Phe Leu Glu Asn 

Ser Met Gin Thr 
240 

Pro Ser Val Pro 
255 

Tyr Val Asp Arg 
270 

Leu Arg Cys Trp 
285 

Trp Phe Pro Arg 

Val Asp Glu He 
320 
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Gin Ala Arg Tyr Pro His Leu Leu Glu Gin Leu Leu Ser Thr Ser Asp 

325 330 335 

Thr Pro Gly Glu Glu Asn Ala Asp Pro Thr Glu Thr Val Val His Phe 

340 345 350 

Gly Pro Gly Glu Ser Ser Lys Asp Val Val Met Met Ser Thr Pro Val 

355 360 365 

Val Lys Ala Ala Leu Glu Met Gly Phe Ser Arg Ser Leu Val Arg Gin 

370 375 380 

Thr Val Gin Arg Gin lie Leu Ala Thr Gly Glu Asn Tyr Arg Thr Val 
385 390 395 400 

Asn Asp lie Val Ser Val Leu Leu Asn Ala Glu Asp Glu Arg Arg Glu 

405 410 415 

Glu Glu Lys Glu Arg Gin Thr Glu Glu Met Ala Ser Gly Asp Leu Ser 

420 425 430 

Leu lie Arg Lys Asn Arg Met Ala Leu Phe Gin Gin Leu Thr His Val 

435 440 445 

Leu Pro lie Leu Asp Asn Leu Leu Glu Ala Ser Val lie Thr Lys Gin 

450 455 460 

Glu His Asp lie lie Arg Gin Lys Thr Gin lie Pro Leu Gin Ala Arg 
465 470 475 480 

Glu Leu lie Asp Thr Val Leu Val Lys Gly Asn Ala Ala Ala Asn lie 

485 490 495 

Phe Lys Asn Ser Leu Lys Gly lie Asp Ser Thr Leu Tyr Glu Asn Leu 

500 505 510 

Phe Val Glu Lys Asn Met Lys Tyr lie Pro Thr Glu Asp Val Ser Gly 

515 520 525 

Leu Ser Leu Glu Glu Gin Leu Arg Arg Leu Gin Glu Glu Arg Thr Cys 

530 535 540 

Lys Val Cys Met Asp Arg Glu Val Ser lie Val Phe lie Pro Cys Gly 
545 550 555 560 

His Leu Val Val Cys Gin Glu Cys Ala Pro Ser Leu Arg Lys Cys Pro 

565 570 575 

lie Cys Arg Gly Thr lie Lys Gly Thr Val Arg Thr Phe Leu Ser 
580 585 590 



<210> 43 
<211> 11 
<212> PRT 

<213> artificial sequence based on Homo sapiens 
<400> 43 

Met Glu Gin Lys Leu lie Ser Glu Glu Asp Leu 
15 10 



<210> 44 
<211> 635 
<212> PRT 

<213> artificial sequence based on Homo sapiens, Mus musculus, Cydia 
pomonella, and Drosophila melanogaster 
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<220> 

<221> VARIANT 
<222> 1,2,3,635 

<22 3> any amino acid or may be absent 



<221> VARIANT 

<222> (1) . . . (635) 

<223> Xaa = Any Amino Acid 



<400> 44 
Xaa Xaa Xaa Xaa 
1 

Xaa Xaa Xaa Xaa 
20 

Leu Xaa Thr Phe 
35 

Xaa Leu Ala Arg 
50 

Xaa Cys Phe Xaa 
65 

Ser Xaa Xaa Xaa 

Xaa Xaa Xaa Xaa 
100 

Xaa Xaa Xaa Xaa 
115 

Xaa Xaa Xaa Xaa 
130 

Xaa Xaa Xaa Xaa 
145 

Xaa Xaa Xaa Xaa 

Xaa Xaa Xaa Met 
180 

Trp Pro Xaa Xaa 
195 

Phe Tyr Tyr Xaa 
210 

Gly Lys Leu Xaa 
225 

Xaa Arg His Phe 

Xaa Xaa Xaa Xaa 
260 

Ser Xaa Xaa Xaa 
275 

Xaa Xaa Thr Phe 
290 

Gin Leu Ala Xaa 
305 

Lys Cys Phe Xaa 

Asp Pro Trp Xaa 
340 



Xaa Xaa Xaa Xaa 
5 

Xaa Xaa Xaa Xaa 

Xaa Xaa Phe Pro 
40 

Ala Gly Phe Xaa 
55 

Cys Xaa Xaa Xaa 
70 

Xaa His Xaa Xaa 
85 

Xaa Xaa Xaa Xaa 

Xaa Xaa Xaa Xaa 
120 

Xaa Xaa Xaa Xaa 
135 

Xaa Arg Xaa Xaa 
150 

Xaa Xaa Asp Xaa 
165 

Xaa Xaa Glu Glu 
Xaa Xaa Leu Xaa 

200 

Gly Xaa Xaa Asp 
215 

Asn Trp Glu Pro 
230 

Pro Xaa Cys Pro 
245 

Xaa Xaa Xaa Xaa 

Pro Xaa Asn Pro 
280 

Xaa Xaa Trp Pro 
295 

Ala Gly Phe Tyr 
310 

Cys Xaa Gly Gly 
325 

Gin His Ala Lys 



Xaa Xaa Xaa Xaa 
10 

Xaa Phe Xaa Xaa 
25 

Xaa Xaa Xaa Pro 

Tyr Thr Gly Xaa 
60 

Xaa Asp Xaa Trp 
75 

Xaa Xaa Pro Xaa 
90 

Xaa Xaa Xaa Xaa 
105 

Xaa Xaa Xaa Ser 

Xaa Xaa Tyr Xaa 
140 

Glu Xaa Xaa Xaa 
155 

Ser Asp Xaa Xaa 
170 

Ala Arg Leu Xaa 
185 

Pro Xaa Glu Leu 

Xaa Val Xaa Cys 
220 

Xaa Asp Xaa Ala 
235 

Phe Val Xaa Xaa 
250 

Xaa Xaa Xaa Xaa 
265 

Xaa Met Ala Xaa 

Xaa Ser Xaa Xaa 
300 

Tyr Xaa Gly Xaa 
315 

Leu Xaa Xaa Trp 
330 

Trp Phe Pro Xaa 
345 



Xaa Xaa Xaa Xaa 
15 

Glu Xaa Xaa Arg 
30 

Val Ser Xaa Xaa 
45 

Xaa Asp Xaa Val 

Xaa Xaa Gly Asp 
80 

Cys Xaa Phe lie 
95 

Xaa Xaa Xaa Xaa 
110 

Xaa Xaa Xaa Xaa 
125 

Xaa Xaa Xaa Xaa 

Xaa Xaa Xaa Xaa 
160 

Xaa Xaa Xaa Xaa 
175 

Thr Phe Xaa Xaa 
190 

Ala Xaa Ala Gly 
205 

Phe Xaa Cys Gly 

Xaa Ser Glu His 
240 

Xaa Xaa Xaa Xaa 
255 

Xaa Phe Xaa Xaa 
270 

Xaa Xaa Ala Arg 
285 

Val Xaa Xaa Glu 

Gly Asp Xaa Val 
320 

Xaa Xaa Xaa Asp 
335 

Cys Xaa Tyr Leu 
350 
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Xaa Xaa 

Xaa Xaa 
370 
Xaa Xaa 
385 

Xaa Xaa 

Xaa Met 

He Xaa 

Asp Leu 
450 
Xaa Xaa 
465 

Xaa Xaa 

Xaa Xaa 

Xaa Xaa 

Xaa Xaa 
530 
Xaa Xaa 
545 

Xaa Xaa 

Gin Leu 

Xaa Glu 

Xaa Xaa 
610 
He Xaa 
625 



Xaa Lys 
355 

Leu Xaa 

Xaa Xaa 

Asp Xaa 

Gly Phe 
420 
Xaa Xaa 
435 

Xaa Xaa 

Xaa Xaa 

Xaa Xaa 

Xaa Xaa 
500 
Xaa Xaa 
515 

Xaa Xaa 

Xaa Xaa 

Xaa Xaa 

Arg Arg 
580 
Val Xaa 
595 

Cys Ala 



Gly Gin 

Glu Xaa 

Xaa Xaa 
390 
Val Xaa 
405 

Xaa Xaa 



Glu Tyr 
360 
Leu Xaa 

375 

Xaa Xaa 
Xaa Xaa 
Xaa Xaa 



Gly Xaa Xaa Tyr 
440 

Ala Xaa Xaa Xaa 
455 

Xaa Xaa Xaa Xaa 
470 

Xaa Xaa Xaa Xaa 
485 

Xaa Xaa Xaa Xaa 



Gin Xaa 

Xaa Xaa 

Xaa Xaa 
550 
Xaa Xaa 
565 

Leu Xaa 



Xaa Leu 
520 
Xaa Xaa 
535 

Xaa Xaa 
Xaa Xaa 
Glu Glu 



Xaa Xaa Xaa 

Xaa Thr Xaa 

Xaa Xaa Xaa 
395 

Xaa Pro Xaa 

410 
Val Lys Xaa 
425 

Xaa Xaa Xaa 

Xaa Xaa Xaa 

Xaa Xaa Xaa 
475 

Xaa Xaa Xaa 

490 
Xaa Xaa Xaa 
505 

Gin Xaa Xaa 

Xaa Xaa Xaa 

Xaa Xaa Xaa 
555 

Xaa Xaa Xaa 

570 
Xaa Leu Cys 
585 

Pro Cys Gly 



Xaa Xaa Xaa Xaa 
365 

Xaa Xaa Xaa Xaa 
380 

Pro Xaa Xaa Xaa 



Val Xaa 

Xaa Xaa 

Xaa Xaa 
445 
Xaa Glu 
460 

Xaa Xaa 



Xaa Ala 
415 
Xaa Xaa 
430 

Leu Val 
Xaa Xaa 
Xaa Xaa 



Xaa Val Phe Xaa 
600 

Xaa Ser Val Xaa Lys Cys Pro 
615 

Xaa Xaa Xaa Xaa Xaa Phe 
63 0 



Xaa Xaa Xaa Xaa 
495 

Xaa Xaa Xaa Xaa 
510 

Xaa Xaa Xaa Xaa 
525 

Xaa Xaa Xaa Xaa 
540 

Xaa Xaa Xaa Xaa 

Xaa Ser Xaa Glu 
575 

Lys Xaa Cys Met 
590 

His Leu Val Xaa 
605 

Met Cys Arg Xaa 
620 



Xaa 

Xaa 

Xaa 
400 
Xaa 

Lys 

Xaa 

Xaa 

Xaa 
480 
Xaa 

Xaa 

Xaa 

Xaa 

Xaa 
560 
Glu 

Asp 

Cys 

Xaa 



Leu Ser Xaa 
635 



<210> 45 

<211> 204 

<212> DNA 

<213> Homo sapiens 



gtagtcctgt ttcagcatca 60 
ataccgtgcg gtgctttagt 120 
cagttggaag acacaggaaa 180 

204 

<210> 46 

<211> 204 

<212> DNA 

<213> Homo sapiens 



<400> 45 
gagtttaata 
acactggcac 
tgtcatgcag 
gtatccccaa 



gattaaaaac 
gagcagggtt 
ctgtagatag 
attgcagatt 



ttttgctaat 
tctttatact 
atggcaatat 
tatc 



tttccaagtg 
ggtgaaggag 
ggagactcag 
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<400> 46 

gaagaagcta gattaaagtc ctttcagaac 
gagttagcaa gtgctggact ctactacaca 
tgtggtggaa aactgaaaaa ttgggaacct 
cactttccta attgcttctt tgtt 



tggccagact atgctcacct aaccccaaga 60 
ggtattggtg accaagtgca gtgcttttgt 12 0 
tgtgatcgtg cctggtcaga acacaggcga 180 

204 



<210> 47 

<211> 198 

<212> DNA 

<213> Homo sapiens 



<400> 47 

tatgaagcac ggatctttac ttttgggaca 
gcaagagctg gattttatgc tttaggtgaa 
ggagggctaa ctgattggaa gcccagtgaa 
ccagggtgca aatatctg 

<210> 48 

<211> 138 

<212> DNA 

<213> Homo sapiens 



tggatatact cagttaacaa ggagcagctt 60 
ggtgataaag taaagtgctt tcactgtgga 12 0 
gacccttggg aacaacatgc taaatggtat 180 

198 



<400> 48 

gagcagctaa ggcgcctgca agaggagaag ctttgcaaaa tctgtatgga tagaaatatt 60 
gctatcgttt ttgttccttg tggacatcta gtcacttgta aacaatgtgc tgaagcagtt 12 
gacaagtgtc ccatgtgc 13 

<210> 49 

<211> 204 

<212> DNA 

<213> Mus musculus 



<400> 49 

gagtttaata gattaaaaac atttgctaac 
acattggcgc gagctgggtt tctttatacc 
tgtcatgcgg caatagatag atggcagtat 
atatccccaa attgcagatt tatc 



ttcccaagta gtagtcctgt ttcagcatca 60 
ggtgaaggag acaccgtgca atgtttcagt 12 0 
ggagactcag ctgttggaag acacaggaga 180 

204 



<210> 50 

<211> 204 

<212> DNA 

<213> Mus musculus 



<400> 50 

gaagaagcca gattgaagtc atttcagaac 
gagttagcta gtgctggcct ctactacaca 
tgtgggggaa aactgaaaaa ttgggaaccc 
cactttccca attgcttttt tgtt 



tggccggact atgctcattt aacccccaga 60 
ggggctgatg atcaagtgca atgcttttgt 12 0 
tgtgatcgtg cctggtcaga acacaggaga 180 

204 



<210> 51 
<211> 198 
<212> DNA 
<213> Mus 



musculus 
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<400> 51 

tatgaagcac ggatcgttac ttttggaaca 
gcaagagctg gattttatgc tttaggtgaa 
ggagggctca cggattggaa gccaagtgaa 
ccagggtgca aataccta 

<210> 52 

<211> 138 

<212> DNA 

<213> Mus musculus 



tggatatact cagttaacaa ggagcagctt 60 
ggcgataaag tgaagtgctt ccactgtgga 12 0 
gacccctggg accagcatgc taagtgctac 180 

198 



<400> 52 

gagcagctaa ggcgcctaca agaggagaag ctttccaaaa tctgtatgga tagaaatatt 60 
gctatcgttt tttttccttg tggacatctg gccacttgta aacagtgtgc agaagcagtt 120 
gacaaatgtc ccatgtgc 



<210> 53 

<211> 204 

<212> DNA 

<213> Homo sapiens 



138 



<400> 53 

gaactgtacc gaatgtctac gtattccact 
agtcttgctc gtgctggttt ctattacact 
tgtggcctga tgctggataa ctggaaaaga 
ttgtatccta gctgcagatt cgtt 

<210> 54 

<211> 201 

<212> DNA 

<213> Homo sapiens 



tttcctgctg gggttcctgt ctcagaaagg 60 
ggtgtgaatg acaaggtcaa atgcttctgt 12 0 
ggagacagtc ctactgaaaa gcataaaaag 18 0 

204 



<400> 54 

gaaaatgcca gattacttac ttttcagaca 
ctggcacgag caggctttta ctacatagga 
ggtggaaaat tgagcaattg ggaaccgaag 
tttcccaaat gcccatttat a 



tggccattga cttttctgtc gccaacagat 60 
cctggagaca gagtggcttg ctttgcctgt 12 0 
gataatgcta tgtcagaaca cctgagacat 180 

201 



<210> 55 

<211> 204 

<212> DNA 

<213> Homo sapiens 



<400> 55 

catgcagccc gctttaaaac attctttaac 
cagcttgcaa gtgcgggttt ttattatgtg 
tgtgatggtg gactcaggtg ttgggaatct 
tggtttccaa ggtgtgagta cttg 

<210> 56 

<211> 138 

<212> DNA 

<213> Homo sapiens 



tggccctcta gtgttctagt taatcctgag 60 
ggtaacagtg atgatgtcaa atgcttttgc 12 0 
ggagatgatc catgggttca acatgccaag 18 0 

204 
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<400> 56 

gaacaattgc ggagactacc agaagaaaga 
tccatagtgt ttattccttg tggtcatcta 
agaaagtgtc ctatttgt 



acatgtaaag tgtgtatgga caaagaagtg 60 
gtagtatgca aagattgtgc tccttcttta 120 

138 



<210> 57 

<211> 203 

<212> DNA 

<213> Mus musculus 



<400> 57 

agctgtaccg attgtccacg tattcagctt 
gtctggctcg tgctggcttt tactacactg 
gtggcctgat gctagacaac tggaaacaag 
tgtaccccag ctgcaacttt gta 



ttcccagggg agttcctgtg tcagaaagga 60 
gtgccaatga caaggtcaag tgcttctgct 12 0 
gggacagtcc catggagaag cacagaaagt 18 0 

203 



<210> 58 

<211> 201 
P5 <212> DNA 
% <213> Mus musculus 

fll <4 00> 58 

" j gagaaggcca gattactcac ctatgaaaca tggccattgt cttttctgtc accagcaaag 60 

ctggccaaag caggcttcta ctacatagga cctggagata gagtggcctg ctttgcgtgc 12 0 

"4 gatgggaaac tgagcaactg ggaacgtaag gatgatgcta tgtcagagca ccagaggcat 180 
43 ttccccagct gtccgttctt a 201 

r <210> 59 
H <211> 204 
'% <212> DNA 

<213> Mus musculus 

^ <400> 59 

y cacgcagccc gtattagaac attctctaac tggccttcta gtgcactagt tcattcccag 60 
Q gaacttgcaa gtgcgggctt ttattataca ggacacagtg atgatgtcaa gtgtttatgc 12 0 
tgtgatggtg ggctgaggtg ctgggaatct ggagatgacc cctgggtgga acatgccaag 18 0 
tggtttccaa ggtgtgagta cttg 2 04 



<210> 60 

<211> 138 

<212> DNA 

<213> Mus musculus 



<400> 60 

gaacagttgc ggcccctccc ggaggacaga 
tccatcgtgt tcattccctg tggccatctg 
aggaagtgtc ccatctgt 



atgtgtaaag tgtgtatgga ccgagaggta 6 0 
gtcgtgtgca aagactgcgc tccctctctg 12 0 

138 



<210> 61 

<211> 204 

<212> DNA 

<213> Homo sapiens 
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<400> 61 

gaactctaca gaatgtctac atattcaact 
agtcttgctc gtgctggttt ttattatact 
tgtggcctga tgctggataa ctggaaacta 
ctatatccta gctgtagctt tatt 



ttccccgccg gggtgcctgt ctcagaaagg 60 
ggtgtgaatg acaaggtcaa atgcttctgt 12 0 
ggagacagtc ctattcaaaa gcataaacag 180 

204 



<210> 62 

<211> 201 

<212> DNA 

<213> Homo sapiens 



<400> 62 

gaagaagcca gatttcttac ctaccatatg 
ttggcaagag ctggttttta ttatatagga 
ggtgggaagc tcagtaactg ggaaccaaag 
tttcccaact gtccattttt g 



tggccattaa cttttttgtc accatcagaa 60 
cctggagata gggtagcctg ctttgcctgt 120 
gatgatgcta tgtcagaaca ccggaggcat 180 

201 



<210> 63 

<211> 204 

<212> DNA 

<213> Homo sapiens 



<400> 63 

catgcagctc gaatgagaac atttatgtac 
cagcttgcaa gtgctggttt ttattatgtg 
tgtgatggtg gcttgaggtg ttgggaatct 
tggtttccaa ggtgtgagtt cttg 

<210> 64 

<211> 138 

<212> DNA 

<213> Mus musculus 



tggccatcta gtgttccagt tcagcctgag 60 
ggtcgcaatg atgatgtcaa atgctttggt 12 0 
ggagatgatc catgggtaga acatgccaag 180 

204 



<400> 64 

gaacaattga ggaggttgca agaagaacga acttgtaaag tgtgtatgga caaagaagtt 6 
tctgttgtat ttattccttg tggtcatctg gtagtatgcc aggaatgtgc cccttctcta 1 
agaaaatgcc ctatttgc 1 

<210> 65 

<211> 204 

<212> DNA 

<213> Mus musculus 



<400> 65 

gaactctacc gaatgtctac atattcagct 
agtctggctc gtgctggctt ttattataca 
tgtggcctga tgttggataa ctggaaacaa 
ttctatccca gctgcagctt tgta 



tttcccaggg gagttcctgt ctcagagagg 60 
ggtgtgaatg acaaagtcaa gtgcttctgc 12 0 
ggggacagtc ctgttgaaaa gcacagacag 180 

204 



<210> 66 

<211> 201 

<212> DNA 

<213> Mus musculus 
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<400> 66 

gaagaggcca gatttcttac ttacagtatg tggcctttaa gttttctgtc 
ctggccagag ctggcttcta ttacataggg cctggagaca gggtggcctg 
ggtgggaaac tgagcaactg ggaaccaaag gattatgcta tgtcagagca 
tttccccact gtccatttct g 

<210> 67 
<211> 204 
<212> DNA 
<213> Mus musculus 

<400> 67 

cactctgctc gattgaggac 
cagcttgcaa gtgctggatt 
tgtgatggtg gcttgagatg 
tggtttccaa ggtgtgagtt 

<210> 68 
<211> 114 
<212> DNA 
<213> Mus musculus 

<400> 68 

gaacgaactt gcaaagtgtg tatggacaga gaggtttcta ttgtgttcat tccgtgtggt 60 
catctagtag tctgccagga atgtgcccct tctctaagga agtgccccat ctgc 11 

<210> 69 
<211> 68 
<212> PRT 

<213> Homo sapiens 
<400> 69 

Glu Phe Asn Arg Leu Lys Thr Phe Ala Asn Phe Pro 

15 10 
Val Ser Ala Ser Thr Leu Ala Arg Ala Gly Phe Leu 

20 25 
Gly Asp Thr Val Arg Cys Phe Ser Cys His Ala Ala 

35 40 
Gin Tyr Gly Asp Ser Ala Val Gly Arg His Arg Lys 

50 55 60 

Cys Arg Phe lie 
65 



accagcagag 60 
ttttgcctgt 120 
ccgcagacat 180 
201 



atttctgtac tggccaccta 
ctattacgtg gatcgcaatg 
ttgggaacct ggagatgacc 
cttg 



gtgttcctgt tcagcccgag 60 
atgatgtcaa gtgcctttgt 120 
cctggataga acacgccaaa 180 

204 



Ser Gly Ser Pro 
15 

Tyr Thr Gly Glu 
30 

Val Asp Arg Trp 
45 

Val Ser Pro Asn 



<210> 70 
<211> 68 
<212> PRT 

<213> Homo sapiens 
<400> 70 

Glu Glu Ala Arg Leu Lys Ser Phe 

1 5 
Leu Thr Pro Arg Glu Leu Ala Ser 
20 



Gin Asn Trp Pro Asp Tyr Ala His 

10 15 
Ala Gly Leu Tyr Tyr Thr Gly lie 
25 30 
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Gly Asp Gin Val Gin Cys Phe Cys 

35 40 
Glu Pro Cys Asp Arg Ala Trp Ser 

50 55 
Cys Phe Phe Val 
65 



Cys Gly Gly Lys Leu Lys Asn Trp 

45 

Glu His Arg Arg His Phe Pro Asn 
60 



<210> 71 
<211> 66 
<212> PRT 

<213> Homo sapiens 



<400> 71 
Tyr Glu Ala Arg 
1 

Lys Glu Gin Leu 
20 

Lys Val Lys Cys 
35 

Ser Glu Asp Pro 

50 
Tyr Leu 
65 



lie Phe Thr Phe 
5 

Ala Arg Ala Gly 

Phe His Cys Gly 
40 

Trp Glu Gin His 
55 



Gly Thr Trp lie 
10 

Phe Tyr Ala Leu 
25 

Gly Gly Leu Thr 

Ala Lys Trp Tyr 
60 



Tyr Ser Val Asn 
15 

Gly Glu Gly Asp 
30 

Asp Trp Lys Pro 
45 

Pro Gly Cys Lys 



<210> 72 

<211> 46 

<212> PRT 

213> Homo sapiens 

<400> 72 

Glu Gin Leu Arg Arg Leu Gin Glu Glu Lys Leu Cys Lys lie Cys Met 

15 10 15 

Asp Arg Asn He Ala He Val Phe Val Pro Cys Gly His Leu Val Thr 

20 25 30 

Cys Lys Gin Cys Ala Glu Ala Val Asp Lys Cys Pro Met Cys 
35 40 45 



<210> 73 
<211> 68 
<212> PRT 

<213> Mus musculus 
<400> 73 

Glu Phe Asn Arg Leu Lys Thr Phe Ala Asn Phe Pro Ser Ser Ser Pro 

15 10 15 

Val Ser Ala Ser Thr Leu Ala Arg Ala Gly Phe Leu Tyr Thr Gly Glu 

20 25 30 

Gly Asp Thr Val Gin Cys Phe Ser Cys His Ala Ala He Asp Arg Trp 
35 40 45 
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Gin Tyr Gly Asp Ser Ala Val Gly Arg His Arg Arg lie Ser Pro Asn 

50 55 60 

Cys Arg Phe lie 
65 



<210> 74 
<211> 68 
<212> PRT 

<213> Mus musculus 
<400> 74 

Glu Glu Ala Arg Leu Lys Ser Phe Gin Asn Trp Pro Asp Tyr Ala His 

15 10 15 

Leu Thr Pro Arg Glu Leu Ala Ser Ala Gly Leu Tyr Tyr Thr Gly Ala 

20 25 30 

Asp Asp Gin Val Gin Cys Phe Cys Cys Gly Gly Lys Leu Lys Asn Trp 

35 40 45 

Glu Pro Cys Asp Arg Ala Trp Ser Glu His Arg Arg His Phe Pro Asn 

50 55 60 

Cys Phe Phe Val 
65 



<210> 75 
<211> 66 
<212> PRT 

<213> Mus musculus 
<400> 75 

Tyr Glu Ala Arg He Val Thr Phe Gly Thr Trp He Tyr Ser Val Asn 

15 10 15 

Lys Glu Gin Leu Ala Arg Ala Gly Phe Tyr Ala Leu Gly Glu Gly Asp 

20 25 30 

Lys Val Lys Cys Phe His Cys Gly Gly Gly Leu Thr Asp Trp Lys Pro 

35 40 45 

Ser Glu Asp Pro Trp Asp Gin His Ala Lys Cys Tyr Pro Gly Cys Lys 

50 55 60 

Tyr Leu 
65 



<210> 76 
<211> 46 
<212> PRT 

<213> Mus musculus 
<400> 76 

Glu Gin Leu Arg Arg Leu Gin Glu Glu Lys Leu Ser Lys He Cys Met 

15 10 15 

Asp Arg Asn He Ala He Val Phe Phe Pro Cys Gly His Leu Ala Thr 

20 25 30 

Cys Lys Gin Cys Ala Glu Ala Val Asp Lys Cys Pro Met Cys 
35 40 45 
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<210> 77 
<211> 68 
<212> PRT 
<213> Homo 



sapiens 



<400> 77 
Glu Leu Tyr Arg 
1 

Val Ser Glu Arg 
20 

Asn Asp Lys Val 
35 

Lys Arg Gly Asp 
50 

Cys Arg Phe Val 
65 



Met Ser Thr Tyr 
5 

Ser Leu Ala Arg 

Lys Cys Phe Cys 
40 

Ser Pro Thr Glu 
55 



Ser Thr Phe Pro 
10 

Ala Gly Phe Tyr 
25 

Cys Gly Leu Met 

Lys His Lys Lys 
60 



Ala Gly Val Pro 
15 

Tyr Thr Gly Val 
30 

Leu Asp Asn Trp 
45 

Leu Tyr Pro Ser 



<210> 78 
<211> 67 
<212> PRT 

<213> Homo sapiens 



<400> 78 
Glu Asn Ala Arg 
1 

Ser Pro Thr Asp 
20 

Asp Arg Val Ala 
35 

Pro Lys Asp Asn 
50 

Pro Phe lie 
65 



Leu Leu Thr Phe 
5 

Leu Ala Arg Ala 

Cys Phe Ala Cys 
40 

Ala Met Ser Glu 
55 



Gin Thr Trp Pro 
10 

Gly Phe Tyr Tyr 
25 

Gly Gly Lys Leu 

His Leu Arg His 
60 



Leu Thr Phe Leu 
15 

lie Gly Pro Gly 
30 

Ser Asn Trp Glu 
45 

Phe Pro Lys Cys 



<210> 79 

<211> 68 

<212> PRT 

<213> Homo 



sapiens 



<400> 79 

His Ala Ala Arg Phe Lys Thr Phe Phe Asn Trp Pro Ser Ser Val Leu 

15 10 15 

Val Asn Pro Glu Gin Leu Ala Ser Ala Gly Phe Tyr Tyr Val Gly Asn 

20 25 30 

Ser Asp Asp Val Lys Cys Phe Cys Cys Asp Gly Gly Leu Arg Cys Trp 

35 40 45 

Glu Ser Gly Asp Asp Pro Trp Val Gin His Ala Lys Trp Phe Pro Arg 

50 55 60 

Cys Glu Tyr Leu 
65 
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<210> 80 
<211> 46 
<212> PRT 

<213> Homo sapiens 
<400> 80 

Glu Gin Leu Arg Arg Leu Pro Glu Glu Arg Thr Cys Lys Val Cys Met 

15 10 15 

Asp Lys Glu Val Ser He Val Phe He Pro Cys Gly His Leu Val Val 

20 25 30 

Cys Lys Asp Cys Ala Pro Ser Leu Arg Lys Cys Pro He Cys 
35 40 45 



<210> 81 

<211> 68 

<212> PRT 

<213> Mus musculus 



<400> 81 
Glu Leu Tyr Arg 
1 

Val Ser Glu Arg 
20 

Asn Asp Lys Val 
35 

Lys Gin Gly Asp 
50 

Cys Asn Phe Val 
65 



Leu Ser Thr Tyr 
5 

Ser Leu Ala Arg 

Lys Cys Phe Cys 
40 

Ser Pro Met Glu 
55 



Ser Ala Phe Pro 
10 

Ala Gly Phe Tyr 
25 

Cys Gly Leu Met 

Lys His Arg Lys 
60 



Arg Gly Val Pro 
15 

Tyr Thr Gly Ala 
30 

Leu Asp Asn Trp 
45 

Leu Tyr Pro Ser 



<210> 82 

<211> 67 

<212> PRT 

<213> Mus musculus 



<400> 82 
Glu Lys Ala Arg 
1 

Ser Pro Ala Lys 
20 

Asp Arg Val Ala 
35 

Arg Lys Asp Asp 
50 

Pro Phe Leu 
65 



Leu Leu Thr Tyr 
5 

Leu Ala Lys Ala 

Cys Phe Ala Cys 
40 

Ala Met Ser Glu 
55 



Glu Thr Trp Pro 
10 

Gly Phe Tyr Tyr 
25 

Asp Gly Lys Leu 

His Gin Arg His 
60 



Leu Ser Phe Leu 
15 

He Gly Pro Gly 
30 

Ser Asn Trp Glu 
45 

Phe Pro Ser Cys 



<210> 83 

<211> 68 

<212> PRT 

<213> Mus musculus 
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<400> 83 
His Ala Ala Arg 
1 

Val His Ser Gin 
20 

Ser Asp Asp Val 
35 

Glu Ser Gly Asp 
50 

Cys Glu Tyr Leu 
65 



lie Arg Thr Phe 
5 

Glu Leu Ala Ser 

Lys Cys Leu Cys 
40 

Asp Pro Trp Val 
55 



Ser Asn Trp Pro 
10 

Ala Gly Phe Tyr 
25 

Cys Asp Gly Gly 

Glu His Ala Lys 
60 



Ser Ser Ala Leu 
15 

Tyr Thr Gly His 
30 

Leu Arg Cys Trp 
45 

Trp Phe Pro Arg 



<210> 84 

<211> 46 

<212> PRT 

<213> Mus musculus 



<400> 84 

Glu Gin Leu Arg Pro Leu Pro Glu 

1 5 
Asp Arg Glu Val Ser lie Val Phe 
20 

Cys Lys Asp Cys Ala Pro Ser Leu 
35 40 



Asp Arg Met Cys Lys Val Cys Met 

10 15 
lie Pro Cys Gly His Leu Val Val 
25 30 
Arg Lys Cys Pro lie Cys 

45 



<210> 85 

<211> 68 

<212> PRT 

<213> Homo sapiens 



<400> 85 
Glu Leu Tyr Arg 
1 

Val Ser Glu Arg 
20 

Asn Asp Lys Val 
35 

Lys Leu Gly Asp 
50 

Cys Ser Phe lie 
65 



Met Ser Thr Tyr 
5 

Ser Leu Ala Arg 

Lys Cys Phe Cys 
40 

Ser Pro lie Gin 
55 



Ser Thr Phe Pro 
10 

Ala Gly Phe Tyr 
25 

Cys Gly Leu Met 

Lys His Lys Gin 
60 



Ala Gly Val Pro 
15 

Tyr Thr Gly Val 
30 

Leu Asp Asn Trp 
45 

Leu Tyr Pro Ser 



<210> 86 

<211> 67 

<212> PRT 

<213> Homo sapiens 
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<400> 86 
Glu Glu Ala Arg 
1 

Ser Pro Ser Glu 
20 

Asp Arg Val Ala 
35 

Pro Lys Asp Asp 
50 

Pro Phe Leu 
65 



Phe Leu Thr Tyr 
5 

Leu Ala Arg Ala 

Cys Phe Ala Cys 
40 

Ala Met Ser Glu 
55 



His Met Trp Pro 
10 

Gly Phe Tyr Tyr 
25 

Gly Gly Lys Leu 

His Arg Arg His 
60 



Leu Thr Phe Leu 
15 

He Gly Pro Gly 
30 

Ser Asn Trp Glu 
45 

Phe Pro Asn Cys 



<210> 87 

<211> 68 

<212> PRT 

<213> Homo sapiens 



<400> 87 
His Ala Ala Arg 
1 

Val Gin Pro Glu 
20 

Asn Asp Asp Val 
35 

Glu Ser Gly Asp 
50 

Cys Glu Phe Leu 



Met Arg Thr Phe 
5 

Gin Leu Ala Ser 

Lys Cys Phe Gly 
40 

Asp Pro Trp Val 
55 



Met Tyr Trp Pro 
10 

Ala Gly Phe Tyr 
25 

Cys Asp Gly Gly 

Glu His Ala Lys 
60 



Ser Ser Val Pro 
15 

Tyr Val Gly Arg 
30 

Leu Arg Cys Trp 
45 

Trp Phe Pro Arg 



<210> 88 
<211> 46 
<212> PRT 

<213> Homo sapiens 
<400> 88 

Glu Gin Leu Arg Arg Leu Gin Glu Glu Arg Thr Cys Lys Val Cys Met 

15 10 15 

Asp Lys Glu Val Ser Val Val Phe He Pro Cys Gly His Leu Val Val 

20 25 30 

Cys Gin Glu Cys Ala Pro Ser Leu Arg Lys Cys Pro He Cys 
35 40 45 



<210> 89 

<211> 68 

<212> PRT 

<213> Mus musculus 
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<400> 89 
Glu Leu Tyr Arg 
1 

Val Ser Glu Arg 
20 

Asn Asp Lys Val 
35 

Lys Gin Gly Asp 
50 

Cys Ser Phe Val 
65 



Met Ser Thr Tyr 
5 

Ser Leu Ala Arg 

Lys Cys Phe Cys 
40 

Ser Pro Val Glu 
55 



Ser Ala Phe Pro 
10 

Ala Gly Phe Tyr 
25 

Cys Gly Leu Met 

Lys His Arg Gin 
60 



Arg Gly Val Pro 
15 

Tyr Thr Gly Val 
30 

Leu Asp Asn Trp 
45 

Phe Tyr Pro Ser 



<210> 90 

<211> 67 

<212> PRT 

<213> Mus musculus 



<400> 90 
Glu Glu Ala Arg 
1 

Ser Pro Ala Glu 
20 

Asp Arg Val Ala 
35 

Pro Lys Asp Tyr 
50 

Pro Phe Leu 
65 



Phe Leu Thr Tyr 
5 

Leu Ala Arg Ala 

Cys Phe Ala Cys 
40 

Ala Met Ser Glu 
55 



Ser Met Trp Pro 
10 

Gly Phe Tyr Tyr 
25 

Gly Gly Lys Leu 

His Arg Arg His 
60 



Leu Ser Phe Leu 
15 

lie Gly Pro Gly 
30 

Ser Asn Trp Glu 
45 

Phe Pro His Cys 



<210> 91 

<211> 68 

<212> PRT 

<213> Mus musculus 



<400> 91 
His Ser Ala Arg 
l 

Val Gin Pro Glu 
20 

Asn Asp Asp Val 
35 

Glu Pro Gly Asp 
50 

Cys Glu Phe Leu 
65 



Leu Arg Thr Phe 
5 

Gin Leu Ala Ser 

Lys Cys Leu Cys 
40 

Asp Pro Trp lie 
55 



Leu Tyr Trp Pro 
10 

Ala Gly Phe Tyr 
25 

Cys Asp Gly Gly 

Glu His Ala Lys 
60 



Pro Ser Val Pro 
15 

Tyr Val Asp Arg 
30 

Leu Arg Cys Trp 
45 

Trp Phe Pro Arg 



<210> 92 
<211> 38 
<212> PRT 
<213> Mus 



musculus 
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<400> 92 -,,,1 ™, 

Glu Arg Thr Cys Lys Val Cys Met Asp Arg Glu Val Ser lie Val PJie 

15 10 15 

He Pro Cys Gly His Leu Val Val Cys Gin Glu Cys Ala Pro Ser Leu 

20 25 30 

Arg Lys Cys Pro He Cys 
35 
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# 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Korneluk, Robert G. 

Mackenzie , Alexander E. 
Baird, Stephen 

(ii) TITLE OF INVENTION: MAMMALIAN IAP GENE FAMILY, PRIMERS , 

PROBES , AND DETECTION METHODS 

(iii) NUMBER OF SEQUENCES: 42 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson P.C. 

(B) STREET: 225 Franklin Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110-2804 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/511,485 

(B) FILING DATE: 04-AUG-1995 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Clark, Paul T. 

(B) REGISTRATION NUMBER: 30,162 

(C) REFERENCE /DOCKET NUMBER: 07891/002001 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617/542-5070 

(B) TELEFAX: 617/542-8906 

(C) TELEX: 200154 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa at positons 2, 3, 4, 5, 
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• ♦ 



6, 7, 9, 10, 11, 17, 18, 19, 20, 21, 23, 25, 30, 31, 32, 34, 35, 
38 , 39, 40, 41, 42, and 45 may be any amino acid. Xaa at positon 
8 is Glu or Asp* Xaa at positions 14 & 22 is Val or lie. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Lys Xaa Cys Met 
15 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Xaa Pro Cys Gly His Xaa Xaa Xaa 
20 25 30 

Cys Xaa Xaa Cys Ala Xaa Xaa Xaa Xaa Xaa Cys Pro Xaa Cys 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 2: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: both 

MOLECULE TYPE: protein 



FEATURE : 

(D) OTHER INFORMATION: Xaa at positions 1, 2, 3, 
6, 9, 10, 14, 15, 18, 19, 20, 21, 24, 30, 32, 33, 35, 37, 40, 
42, 43, 44, 45, 46, 47, 49, 50, 51, 53, 54, 55, 56, 57, 59, 60, 
61, 62, 64 and 66 may be any amino acid. Xaa at positions 13, 16 and 17 may 
be any amino acid or may be absent. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Xaa Xaa Xaa Arg Leu Xaa Thr Phe Xaa Xaa Trp Pro Xaa Xaa Xaa Xaa 
15 10 15 



ff! (ii) 



(ix) 



Xaa Xaa Xaa Xaa Xaa Leu Ala Xaa 
20 

Xaa Asp Xaa Val Xaa Cys Phe Xaa 
35 40 

Xaa Xaa Xaa Asp Xaa Xaa Xaa Xaa 
50 55 

Cys Xaa Phe Val 

65 



Ala Gly Phe Tyr Tyr Xaa Gly Xaa 
25 30 

Cys Xaa Xaa Xaa Xaa Xaa Xaa Trp 

45 

Xaa His Xaa Xaa Xaa Xaa Pro Xaa 
60 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: DNA (genomic) 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GAAAAGGTGG ACAAGTCCTA TTTTGAAGAG AAGATGACTT TTAACAGTTT TGAAGGATCT 60 

AAAACTTGTG TACCTGCAGA CATCAATAAG GAAGAAGAAT TTGTAGAAGA GTTTAATAGA 120 

TTAAAAACTT TTGCTAATTT TCCAAGTGGT AGTCCTGTTT CAGCATCAAC ACTGGCACGA 180 

GCAGGGTTTC TTTATACTGG TGAAGGAGAT ACCGTGCGGT GCTTTAGTTG TCATGCAGCT 240 

GTAGATAGAT GGCAATATGG AGACTCAGCA GTTGGAAGAC ACAGGAAAGT ATCCCCAAAT 300 

TGCAGATTTA TCAACGGCTT TTATCTTGAA AATAGTGCCA CGCAGTCTAC AAATTCTGGT 360 

ATCCAGAATG GTCAGTACAA AGTTGAAAAC TATCTGGGAA GCAGAGATCA TTTTGCCTTA 420 

GACAGGCCAT CTGAGACACA TGCAGACTAT CTTTTGAGAA CTGGGCAGGT TGTAGATATA 480 

TCAGACACCA TATACCCGAG GAACCCTGCC ATGTATTGTG AAGAAGCTAG ATTAAAGTCC 540 

TTTCAGAACT GGCCAGACTA TGCTCACCTA ACCCCAAGAG AGTTAGCAAG TGCTGGACTC 600 

TACTACACAG GTATTGGTGA CCAAGTGCAG TGCTTTTGTT GTGGTGGAAA ACTGAAAAAT 660 

TGGGAACCTT GTGATCGTGC CTGGTCAGAA CACAGGCGAC ACTTTCCTAA TTGCTTCTTT 720 

GTTTTGGGCC GGAATCTTAA TATTCGAAGT GAATCTGATG CTGTGAGTTC TGATAGGAAT 780 

TTCCCAAATT CAACAAATCT TCCAAGAAAT CCATCCATGG CAGATTATGA AGCACGGATC 840 

TTTACTTTTG GGACATGGAT ATACTCAGTT AACAAGGAGC AGCTTGCAAG AGCTGGATTT 900 

TATGCTTTAG GTGAAGGTGA TAAAGTAAAG TGCTTTCACT GTGGAGGAGG GCTAACTGAT 960 

TGGAAGCCCA GTGAAGACCC TTGGGAACAA CATGCTAAAT GGTATCCAGG GTGCAAATAT 1020 

CTGTTAGAAC AGAAGGGACA AGAATATATA AACAATATTC ATTTAACTCA TTCACTTGAG 1080 

GAGTGTCTGG TAAGAACTAC TGAGAAAACA CCATCACTAA CTAGAAGAAT TGATGATACC 1140 

ATCTTCCAAA ATCCTATGGT ACAAGAAGCT ATACGAATGG GGTTCAGTTT CAAGGACATT 1200 



AAGAAAATAA 


TGGAGGAAAA 


AATTCAGATA 


TCTGGGAGCA 


ACTATAAATC 


ACTTGAGGTT 


1260 


CTGGTTGCAG 


ATCTAGTGAA 


TGCTCAGAAA 


GACAGTATGC 


AAGATGAGTC 


AAGTCAGACT 


1320 


TCATTACAGA 


AAGAGATTAG 


TACTGAAGAG 


CAGCTAAGGC 


GCCTGCAAGA 


GGAGAAGCTT 


1380 


TGCAAAATCT 


GTATGGATAG 


AAATATTGCT 


ATCGTTTTTG 


TTCCTTGTGG 


ACATCTAGTC 


1440 


ACTTGTAAAC 


AATGTGCTGA 


AGCAGTTGAC 


AAGTGTCCCA 


TGTGCTACAC 


AGTCATTACT 


1500 


TTCAAGCAAA 


AAATTTTTAT 


GTCTTAATCT 


AACTCTATAG 


TAGGCATGTT 


ATGTTGTTCT 


1560 


TATTACCCTG 


ATTGAATGTG 


TGATGTGAAC 


TGACTTTAAG 


TAATCAGGAT 


TGAATTCCAT 


1620 


TAGCATTTGC 


TACCAAGTAG 


GAAAAAAAAT 


GTACATGGCA 


GTGTTTTAGT 


TGGCAATATA 


1680 


ATCTTTGAAT 


TTCTTGATTT 


TTCAGGGTAT 


TAGCTGTATT 


ATCCATTTTT 


TTTACTGTTA 


1740 
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TTTAATTGAA ACCATAGACT AAGAATAAGA AGCATCATAC TATAACTGAA CACAATGTGT 1800 

ATTCATAGTA TACTGATTTA ATTTCTAAGT GTAAGTGAAT TAATCATCTG GATTTTTTAT 1860 

TCTTTTCAGA TAGGCTTAAC AAATGGAGCT TTCTGTATAT AAATGTGGAG ATTAGAGTTA 1920 

ATCTCCCCAA TCACATAATT TGTTTTGTGT GAAAAAGGAA TAAATTGTTC CATGCTGGTG 1980 

GAAAGATAGA GATTGTTTTT AGAGGTTGGT TGTTGTGTTT TAGGATTCTG TCCATTTTCT 2040 

TGTAAAGGGA TAAACACGGA CGTGTGCGAA ATATGTTTGT AAAGTGATTT GCCATTGTTG 2100 

AAAGCGTATT TAATGATAGA ATACTATCGA GCCAACATGT ACTGACATGG AAAGATGTCA 2160 

GAGATATGTT AAGTGTAAAA TGCAAGTGGC GGGACACTAT GTATAGTCTG AGCCAGATCA 2220 

AAGTATGTAT GTTGTTAATA TGCATAGAAC GAGAGATTTG GAAAGATATA CACCAAACTG 2280 

TTAAATGTGG TTTCTCTTCG GGGhGGGGGG GATTGGGGGA GGGGCCCCAG AGGGGTTTTA 2340 

GAGGGGCCTT TTCACTTTCG ACTTTTTTCA TTTTGTTCTG TTCGGATTTT TTATAAGTAT 2400 

GTAGACCCCG AAGGGTTTTA TGGGAACTAA CATCAGTAAC CTAACCCCCG TGACTATCCT 2460 

GTGCTCTTCC TAGGGAGCTG TGTTGTTTCC CACCCACCAC CCTTCCCTCT GAACAAATGC 2520 

CTGAGTGCTG GGGCACTTTN 2540 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 497 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Thr Phe Asn Ser Phe Glu Gly Ser Lys Thr Cys Val Pro Ala Asp 
15 10 15 

lie Asn Lys Glu Glu Glu Phe Val Glu Glu Phe Asn Arg Leu Lys Thr 
20 25 30 

Phe Ala Asn Phe Pro Ser Gly Ser Pro Val Ser Ala Ser Thr Leu Ala 
35 40 45 

Arg Ala Gly Phe Leu Tyr Thr Gly Glu Gly Asp Thr Val Arg Cys Phe 
50 55 60 

Ser Cys His Ala Ala Val Asp Arg Trp Gin Tyr Gly Asp Ser Ala Val 
65 70 75 80 

Gly Arg His Arg Lys Val Ser Pro Asn Cys Arg Phe lie Asn Gly Phe 

85 90 95 

Tyr Leu Glu Asn Ser Ala Thr Gin Ser Thr Asn Ser Gly lie Gin Asn 
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100 105 110 

Gly Gin Tyr Lys Val Glu Asn Tyr Leu Gly Ser Arg Asp His Phe Ala 
115 120 125 

Leu Asp Arg Pro Ser Glu Thr His Ala Asp Tyr Leu Leu Arg Thr Gly 
130 135 140 

Gin Val Val Asp lie Ser Asp Thr lie Tyr Pro Arg Asn Pro Ala Met 
145 150 155 160 

Tyr Cys Glu Glu Ala Arg Leu Lys Ser Phe Gin Asn Trp Pro Asp Tyr 

165 170 175 

Ala His Leu Thr Pro Arg Glu Leu Ala Ser Ala Gly Leu Tyr Tyr Thr 
180 185 190 

Gly lie Gly Asp Gin Val Gin Cys Phe Cys Cys Gly Gly Lys Leu Lys 
195 200 205 

Asn Trp Glu Pro Cys Asp Arg Ala Trp Ser Glu His Arg Arg His Phe 
210 215 220 

Pro Asn Cys Phe Phe Val Leu Gly Arg Asn Leu Asn lie Arg Ser Glu 
225 230 235 240 

Ser Asp Ala Val Ser Ser Asp Arg Asn Phe Pro Asn Ser Thr Asn Leu 

245 250 255 

Pro Arg Asn Pro Ser Met Ala Asp Tyr Glu Ala Arg lie Phe Thr Phe 
260 265 270 

Gly Thr Trp lie Tyr Ser Val Asn Lys Glu Gin Leu Ala Arg Ala Gly 
275 280 285 

Phe Tyr Ala Leu Gly Glu Gly Asp Lys Val Lys Cys Phe His Cys Gly 
290 295 300 

Gly Gly Leu Thr Asp Trp Lys Pro Ser Glu Asp Pro Trp Glu Gin His 
305 310 315 320 

Ala Lys Trp Tyr Pro Gly Cys Lys Tyr Leu Leu Glu Gin Lys Gly Gin 

325 330 335 

Glu Tyr lie Asn Asn lie His Leu Thr His Ser Leu Glu Glu Cys Leu 
340 345 350 

Val Arg Thr Thr Glu Lys Thr Pro Ser Leu Thr Arg Arg lie Asp Asp 
355 360 365 

Thr lie Phe Gin Asn Pro Met Val Gin Glu Ala He Arg Met Gly Phe 
370 375 380 

Ser Phe Lys Asp He Lys Lys He Met Glu Glu Lys He Gin He Ser 
385 390 395 400 

Gly Ser Asn Tyr Lys Ser Leu Glu Val Leu Val Ala Asp Leu Val Asn 

405 410 415 

Ala Gin Lys Asp Ser Met Gin Asp Glu Ser Ser Gin Thr Ser Leu Gin 
420 425 430 



55 




Lys Glu lie Ser Thr Glu Glu Gin Leu Arg Arg Leu Gin Glu Glu Lys 
435 440 445 

Leu Cys Lys lie Cys Met Asp Arg Asn lie Ala lie Val Phe Val Pro 
450 455 460 

Cys Gly His Leu Val Thr Cys Lys Gin Cys Ala Glu Ala Val Asp Lys 
465 470 475 480 

Cys Pro Met Cys Tyr Thr Val lie Thr Phe Lys Gin Lys lie Phe Met 

485 490 495 

Ser 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2676 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

□ (ii) MOLECULE TYPE: DNA (genomic) 

42 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



^ TCCTTGAGAT 


GTATCAGTAT 


AGGATTTAGG 


ATCTCCATGT 


TGGAACTCTA 


AATGCATAGA 


60 


UJ AATGGAAATA 


ATGGAAATTT 


TTCATTTTGG 


CTTTTCAGCC 


TAGTATTAAA 


ACTGATAAAA 


120 


H GCAAAGCCAT 


GCACAAAACT 


ACCTCCCTAG 


AGAAAGGCTA 


GTCCCTTTTC 


TTCCCCATTC 


180 


zf ATTTCATTAT 


GAACATAGTA 


GAAAACAGCA 


TATTCTTATC 


AAATTTGATG 


AAAAGCGCCA 


240 


U ACACGTTTGA 


ACTGAAATAC 


GACTTGTCAT 


GTGAACTGTA 


CCGAATGTCT 


ACGTATTCCA 


300 


CTTTTCCTGC 


TGGGGTTCCT 


GTCTCAGAAA 


GGAGTCTTGC 


TCGTGCTGGT 


TTCTATTACA 


360 


CTGGTGTGAA 


TGACAAGGTC 


AAATGCTTCT 


GTTGTGGCCT 


GATGCTGGAT 


AACTGGAAAA 


420 


GAGGAGACAG 


TCCTACTGAA 


AAGCATAAAA 


AGTTGTATCC 


TAGCTGCAGA 


TTCGTTCAGA 


480 


GTCTAAATTC 


CGTTAACAAC 


TTGGAAGCTA 


CCTCTCAGCC 


TACTTTTCCT 


TCTTCAGTAA 


540 


CACATTCCAC 


ACACTCATTA 


CTTCCGGGTA 


CAGAAAACAG 


TGGATATTTC 


CGTGGCTCTT 


600 


ATTCAAACTC 


TCCATCAAAT 


CCTGTAAACT 


CCAGAGCAAA 


TCAAGAATTT 


TCTGCCTTGA 


660 


TGAGAAGTTC 


CTACCCCTGT 


CCAATGAATA 


ACGAAAATGC 


CAGATTACTT 


ACTTTTCAGA 


720 


CATGGCCATT 


GACTTTTCTG 


TCGCCAACAG 


ATCTGGCACG 


AGCAGGCTTT 


TACTACATAG 


780 


GACCTGGAGA 


CAGAGTGGCT 


TGCTTTGCCT 


GTGGTGGAAA 


ATTGAGCAAT 


TGGGAACCGA 


840 


AGGATAATGC 


TATGTCAGAA 


CACCTGAGAC 


ATTTTCCCAA 


ATGCCCATTT 


ATAGAAAATC 


900 


AGCTTCAAGA 


CACTTCAAGA 


TACACAGTTT 


CTAATCTGAG 


CATGCAGACA 


CATGCAGCCC 


960 





GCTTTAAAAC 

^* 4* A 4b44444 444^r 


ATTCTTTAAC 


TGGCCCTCTA 


GTGTTCTAGT 


TAATCCTGAG 


CAGCTTGCAA 


1020 




GTGCGGGTTT 


TTATTATGTG 

4 4* 4 4 4V 4> 4 4 4i X^ 4> N# 


GGTAACAGTG 


ATGATGTCAA 


ATGCTTTTGC 


TGTGATGGTG 


1080 




GACTCAGGTG 

* 4 >^ ^b %^4 4^* X* A 


TTGGGAATCT 

» * Xp* V* X*4444 4- 4> 


GGAGATGATC 


CATGGGTTCA 

^^4 4 ^b> ^b ^b * * 


ACATGCCAAG 


TGGTTTCCAA 


1140 




GGTGTGAGTA 


CTTGATAAGA 

4 Xrf 4 4^4 44 4^»» 4 4 


ATTAAAGGAC 

4 A 4> 4r 4 44 44 4 * 


AGGAGTTCAT 


CCGTCAAGTT 


CAAGCCAGTT 


1200 




ACCCTCATCT 

* * ^b ^^4 * 4> 4 


ACTTGAACAG 

* 4 4^ ^b 4> X^4 44 4 ^t^4 4Xrf 


CTGCTATCCA 

^b ^b- 4 * ^b ^» *■ * 


CATCAGACAG 


CCCAGGAGAT 


GAAAATGCAG 


1260 




AGTCATCAAT 


TATCCATTTG 


GAACCTGGAG 


AAGACCATTC 


AGAAGATGCA 


ATCATGATGA 


1320 




ATACTCCTGT 

* * 4k * * 4* 4> 4> 


GATTAATGCT 

4 * » 4» 4 «4 4 4» 4> 


GCCGTGGAAA 


TGGGCTTTAG 


TAGAAGCCTG 


GTAAAACAGA 


1380 




CAGTTCAGAG 


AAAAATCCTA 


GCAACTGGAG 


AGAATTATAG 


ACTAGTCAAT 


GATCTTGTGT 


1440 




TAGACTTACT 

^b 4 4^^ * 4^^ 4> ^b * 4^*^ 4> 


CAATGCAGAA 

^^4 44 4 ^b ■ 4> «4 4 


GATGAAATAA 


GGGAAGAGGA 


GAGAGAAAGA 


GCAACTGAGG 


1500 




AAAAAGAATC 

A 4444 4444 4%J4 4* A A '■w 


AAATGATTTA 

4 44 44 4 4> * 4 • 4* **4 


TTATTAATCC 

A 4> 4 4 4te 4>444 4* »V 


GGAAGAATAG 

» 4 44 4 *4 * 4 4^V 


AATGGCACTT 


TTTCAACATT 


1560 




TGACTTGTGT 

A b^4 * 4> 4j 4« V* A 


AATTCCAATC 

4 44 * 4» 4> %^4 44 4 4> 


CTGGATAGTC 


TACTAACTGC 


CGG AATTATT 

»J » *4 4 4» 4 4 ^b- ^» 


AATGAACAAG 


1620 




AACATGATGT 

iM* *fc-*4l h4. \J*4 -4> 


TATTAAACAG 

* A 4 44 44 4 %^4 4 


AAGACACAGA 

4 44 4^^ 4 4 ^^4 4 4 4 4 


CGTCTTTACA 


AGCAAGAGAA 


CTGATTGATA 


1680 




CG ATT T T AGT 


AAAAGGAAAT 


ATTGCAGCCA 

4 4 ^ X^44^# ^"4 4 


CTGTATTCAG 

X^ 4> X^ a 4 4 i^b> ^b V* *V 


AAACTCTCTG 


CAAGAAGCTG 


1740 




AAGCTGTGTT 


ATATGAGCAT 


TTATTTGTGC 


AACAGGACAT 


AAAATATATT 

* 44 44 44 4 4> * 4 4> 4 4 4> * 


CCCACAGAAG 

^^4 4 4 4 44 4 


1800 




ATGTTTCAGA 

n x V7 x x x vnvin 


TCTACCAGTG 

X w X nvV/nW X \J 


GAAGAACAAT 


TGCGGAGACT 


ACCAGAAGAA 

4 4^^N^4 4V4 44 *^V4 44 4 


AGAACATGTA 

4 *V 4 44 4 %^ ^ 4 4> 4> 4 4 


1860 




AAGTGTGTAT 


GGACAAAGAA 


GTGTCCATAG 


TGTTTATTCC 


TTGTGGTCAT 

<4 4> 4h Xp* 4» ^^4 4 4> 


CTAGTAGTAT 

4V 4 »V 4* 4 4^* 4V 4 4 4» 


1920 




GCAAAGATTG 

W^^^4*47444%744 <4> 4> 


TGCTCCTTCT 


TTAAGAAAGT 

4* ^ 4 44 4^^ 4 44 44 4^* ^ 


GTCCTATTTG 


TAGGAGTACA 

^ 4 X 4 4 4* 4 4- ^^4 -4 


ATCAAGGGTA 


1980 




CAGTTCGTAC 


ATTT CTTTC A 

4 4 * * ^b 4» ^ V* 4 


TGAAGAAGAA 

«4 w 44 4^r 4 4* 4^v * 44 4 


CCAAAACATC 


GTCTAAACTT 


TAGAATTAAT 


2040 




TTATTAAATG 

4» A 44 <4 4> <W4 -4> ^rf 


TATTATAACT 


TTAACTTTTA 

X 4> 4 44* V^o ^ 4, ^ ^ 44 


TCCTAATTTG 


GTTTCCTTAA 

Nrf 4» 4 4> 4* 4> 4 44 4 


AATT TTT ATT 

4 44 4 ^b 4> ^b- 44> 4> 4 4 4b ^b 


2100 




TATTTACAAC 


TCAAAAAACA 


TTGTTTTGTG 


TAACATATTT 

4 4 4* 4 4^4*^ 4i ^ 


A TATA TG TAT 

4 4 4> 4 4 4> 4 4 -4» X# ^b 4 4 ^b 


CTAAACCATA 


2160 


fl 


TGAACATATA 

4- 4 M i A A 4 4 4 4> A A 


TTTTTTAGAA 


ACTAAGAGAA 


TGATAGGCTT 


TTGTTCTTAT 


GAACGAAAAA 


2220 




GAGGTAGCAC 


TACAAACACA 


ATATTCAATC 

*A**1* 4* ^^4 44 4 * 


CAAATTTCAG 

^^4 4* 44 * » 4> 4 *^ * A 


CATTATTGAA 

4 ^b 4^ 4 4 ^» ^b> 4 4» * 


ATTGTAAGTG 


2280 




AAGTAAAACT 

4 44 IV 4> 4 A* 44 44 4 -4, 


TAAGATATTT 


GAGTTAACCT 

* 4^# ^b ^b 4 4* 4 ^b- 


T T AAG AAT T T 


TAAATATTTT 


GGCATTGTAC 


2340 




TAATACCGGG 


AACATGAAGC 

4 44 4^^* 4 * 4 44 4^* 


CAGGXGTGGT 

^^4 4^* 4b> ^b ^b 


GGTATGTACC 


TGTAGTCCCA 


GGCTGAGGCA 


2400 




AGAGAATTAC 

4 4>t^4 A\J 4 44 4 4 4>44^^ 


TTGAGCCCAG 

^b ^bi 4 4^# ^^4 4^rf 


GAGTTTGAAT 

m 4^# -4 ^ ^b 4 M 4 ^b> 


CCATCCTGGG 


CAGCATACTG 


AGACCCTGCC 


2460 




TTTAAAAACN 


AACAGNACCA 


AANCCAAACA 


CCAGGGACAC 


ATTTCTCTGT 


CTTTTTTGAT 


2520 




CAGTGTCCTA 


TACATCGAAG 


GTGTGCATAT 


ATGTTGAATC 


ACATTTTAGG 


GACATGGTGT 


2580 




TTTTATAAAG 


AATTCTGTGA 


GNAAAAATTT 


AATAAAGCAA 


CCAAATTACT 


CTTAAAAAAA 


2640 




AAAAAAAAAA 


AAAAAACTCG 


AGGGGCCCGT 


ACCAAT 






2676 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 amino acids 

57 



{B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Asn lie Val Glu Asn Ser lie Phe Leu Ser Asn Leu Met Lys Ser 
15 10 15 

Ala Asn Thr Phe Glu Leu Lys Tyr Asp Leu Ser Cys Glu Leu Tyr Arg 
20 25 30 

Met Ser Thr Tyr Ser Thr Phe Pro Ala Gly Val Pro Val Ser Glu Arg 
35 40 45 

Ser Leu Ala Arg Ala Gly Phe Tyr Tyr Thr Gly Val Asn Asp Lys Val 
50 55 60 

Lys Cys Phe Cys Cys Gly Leu Met Leu Asp Asn Trp Lys Arg Gly Asp 
65 70 75 80 

Ser Pro Thr Glu Lys His Lys Lys Leu Tyr Pro Ser Cys Arg Phe Val 

85 90 95 

Gin Ser Leu Asn Ser Val Asn Asn Leu Glu Ala Thr Ser Gin Pro Thr 
100 105 110 

Phe Pro Ser Ser Val Thr His Ser Thr His Ser Leu Leu Pro Gly Thr 
115 120 125 

Glu Asn Ser Gly Tyr Phe Arg Gly Ser Tyr Ser Asn Ser Pro Ser Asn 
130 135 140 

Pro Val Asn Ser Arg Ala Asn Gin Glu Phe Ser Ala Leu Met Arg Ser 
145 150 155 160 

Ser Tyr Pro Cys Pro Met Asn Asn Glu Asn Ala Arg Leu Leu Thr Phe 

165 170 175 

Gin Thr Trp Pro Leu Thr Phe Leu Ser Pro Thr Asp Leu Ala Arg Ala 
180 185 190 

Gly Phe Tyr Tyr lie Gly Pro Gly Asp Arg Val Ala Cys Phe Ala Cys 
195 200 205 

Gly Gly Lys Leu Ser Asn Trp Glu Pro Lys Asp Asn Ala Met Ser Glu 
210 215 220 

His Leu Arg His Phe Pro Lys Cys Pro Phe lie Glu Asn Gin Leu Gin 
225 230 235 240 

Asp Thr Ser Arg Tyr Thr Val Ser Asn Leu Ser Met Gin Thr His Ala 

245 250 255 

Ala Arg Phe Lys Thr Phe Phe Asn Trp Pro Ser Ser Val Leu Val Asn 
260 265 270 



58 



Pro Glu Gin Leu Ala Ser Ala Gly Phe Tyr Tyr Vai Gly Asn Ser Asp 
275 280 285 

Asp Val Lys Cys Phe Cys Cys Asp Gly Gly Leu Arg Cys Trp Glu Ser 
290 295 300 

Gly Asp Asp Pro Trp Val Gin His Ala Lys Trp Phe Pro Arg Cys Glu 
305 310 315 320 

Tyr Leu lie Arg lie Lys Gly Gin Glu Phe lie Arg Gin Val Gin Ala 

325 330 335 

Ser Tyr Pro His Leu Leu Glu Gin Leu Leu Ser Thr Ser Asp Ser Pro 
340 345 350 

Gly Asp Glu Asn Ala Glu Ser Ser lie He His Leu Glu Pro Gly Glu 
355 360 365 

Asp His Ser Glu Asp Ala He Met Met Asn Thr Pro Val He Asn Ala 
370 375 380 

Ala Val Glu Met Gly Phe Ser Arg Ser Leu Val Lys Gin Thr Val Gin 
385 390 395 400 

Arg Lys He Leu Ala Thr Gly Glu Asn Tyr Arg Leu Val Asn Asp Leu 

405 410 415 

Val Leu Asp Leu Leu Asn Ala Glu Asp Glu He Arg Glu Glu Glu Arg 
420 425 430 

Glu Arg Ala Thr Glu Glu Lys Glu Ser Asn Asp Leu Leu Leu He Arg 
435 440 445 

Lys Asn Arg Met Ala Leu Phe Gin His Leu Thr Cys Val He Pro lie 
450 455 460 

Leu Asp Ser Leu Leu Thr Ala Gly He He Asn Glu Gin Glu His Asp 
465 470 475 480 

Val He Lys Gin Lys Thr Gin Thr Ser Leu Gin Ala Arg Glu Leu He 

485 490 495 

Asp Thr He Leu Val Lys Gly Asn He Ala Ala Thr Vai Phe Arg Asn 
500 505 510 

Ser Leu Gin Glu Ala Glu Ala Val Leu Tyr Glu His Leu Phe Val Gin 
515 520 525 

Gin Asp He Lys Tyr He Pro Thr Glu Asp Val Ser Asp Leu Pro Val 
530 535 540 

Glu Glu Gin Leu Arg Arg Leu Pro Glu Glu Arg Thr Cys Lys Val Cys 
545 550 555 560 

Met Asp Lys Glu Val Ser He Val Phe He Pro Cys Gly His Leu Val 

565 570 575 

Val Cys Lys Asp Cys Ala Pro Ser Leu Arg Lys Cys Pro He Cys Arg 
580 585 590 

Ser Thr He Lys Gly Thr Val Arg Thr Phe Leu Ser 
595 600 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2580 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



TTAGGTTACC 


TGAAAGAGTT 


ACTACAACCC 


C AAAG Air 1 1 <j 


TP-TTPT A Af^T 
1 vt X ± ^ X now 


AGTATCTTGG 


60 


TAATTCAGAG 


AGATACTCAT 


CCTACCTGAA 


TATAAACTGA 


GA1 AAA i GOA 


CTAAARAAAG 

VT X AnAUAnAv 


120 


TGTAGTAAAT 


TCTACATAAG 


AGTCTATCAT 


TGATTTCTTT 


TTG I GG 1 GGA 


A ATPTTAP.TT 
rVn. lull rivj x x 


180 

Am W V/ 


CATGTGAAGA 


AATTTCATGT 


GAATGTTTTA 


GCTATCAAAC 


AGTACTGTCA 


CO 1 1 OA X 




CACAAAACTG 


CCTCCCAAAG 


ACTTTTCCCA 


GGTCCCTCGT 


7S fT**"f TV 7S. 7V T\ T\ 

ATC AAAACA 1 


T A If ACT AT A 


300 
~j \j \j 


ATGGAAGATA 


GCACGATCTT 


GTCAGATTGG 


ACAAACAGCA 


•*v >■» Ts ts Tv TV Tv 7V 7V 

ACAAACAAAA 


7A ATT 1 A AflTAT 
AAX vjnnw X AX 




GACTTTTCCT 


GTGAACTCTA 


CAGAATGTCT 


ACATATTCAA 


CTTTCCGGGC 


OGGVjJj X V? OO X 


4?0 


GTCTCAGAAA 


GGAGTCTTGC 


TCGTGCTGGT 


TTTTATTATA 


CTGG I G I GAA 


TP A p A AcnTP 

X VjAOAAVjvj X w 


480 


AAATGCTTCT 


GTTGTGGCCT 


GATGCTGGAT 


AACTGGAAAC 


TAGGAGAGAv 


TPrTATTPAA 


540 

*J W 


AAGCATAAAC 


AGCTATATCC 


TAGCTGTAGC 


TTTATTCAGA 


A 1 G X GG 1 1 1 G 


A fiPT AfiTPTCJ 


600 


GGATCCACCT 


CTAAGAATAC 


GTCTCCAATG 


AGAAACAGTT 


TTGCAGAx 1G 


ATT ATCTrPP 
AX inivlV'Vvsy 


660 

U W V 


ACCTTGGAAC 


ATAGTAGCTT 


GTTCAGTGGT 


TCTTACTCCA 


GCCTTCCT CG 


AAAOOO lv/ll 


720 


AATTCTAGAG 


CAGTTGAAGA 


CATCTCTTCA 


TCGAGGACTA 


AGGGG 1 AGAG 


TTATflP A ATfl 


780 


AGTACTGAAG 


AAGCCAGATT 


TCTTACCTAC 


CATATGTGGC 


CATTAACTTT 


TTTGTCACCA 


840 


TCAGAATTGG 


CAAGAGCTGG 


TTTTTATTAT 


ATAGGACCTG 


GAGATAGGGT 


AGCCTGCTTT 


900 


GCCTGTGGTG 


GGAAGCTCAG 


TAACTGGGAA 


CCAAAGGATG 


ATGCTATGTC 


AGAACACCGG 


960 


AGGCATTTTC 


CCAACTGTCC 


ATTTTTGGAA 


AATTCTCTAG 


AAACTCTGAG 


GTTTAGCATT 


1020 


TCAAATCTGA 


GCATGCAGAC 


ACATGCAGCT 


CGAATGAGAA 


CATTTATGTA 


CTGGCCATCT 


1080 


AGTGTTCCAG 


TTCAGCCTGA 


GCAGCTTGCA 


AGTGCTGGTT 


TTTATTATGT 


GGGTCGCAAT 


1140 


GATGATGTCA 


AATGCTTTGG 


TTGTGATGGT 


GGCTTGAGGT 


GTTGGGAATC 


TGGAGATGAT 


1200 


CCATGGGTAG 


AACATGCCAA 


GTGGTTTCCA 


AGGTGTGAGT 


TCTTGATACG 


AATGAAAGGC 


1260 


CAAGAGTTTG 


TTGATGAGAT 


TCAAGGTAGA 


TATCCTCATC 


TTCTTGAACA 


GCTGTTGTCA 


1320 


ACTTCAGATA 


CCACTGGAGA 


AGAAAATGCT 


GACCCACCAA 


TTATTCATTT 


TGGACCTGGA 


1380 


GAAAGTTCTT 


CAGAAGATGC 


TGTCATGATG 


AATACACCTG 


TGGTTAAATC 


TGCCTTGGAA 


1440 



ATGGGCTTTA 


ATAGAGACCT 


GGTGAAACAA 


ACAGTTCTAA 


GTAAAATCCT 


GACAACTGGA 


1500 


GAGAACTATA 


AAACAGTTAA 


TGATATTGTG 


TCAGCACTTC 


TTAATGCTGA 


AGATGAAAAA 


1560 


AGAGAAGAGG 


AGAAGGAAAA 


ACAAGCTGAA 


GAAATGGCAT 


CAGATGATTT 


GTCATTAATT 


1620 


CGGAAGAACA 


GAATGGCTCT 


CTTTCAACAA 


TTGACATGTG 


TGCTTCCTAT 


CCTGGATAAT 


1680 


CTTTTAAAGG 


CCAATGTAAT 


TAATAAACAG 


GAACATGATA 


TTATTAAACA 


AAAAACACAG 


1740 


ATACCTTTAC 


AAGCGAGAGA 


ACTGATTGAT 


ACCATTTGGG 


TTAAAGGAAA 


TGCTGCGGCC 


1800 


AACATCTTCA 


AAAACTGTCT 


AAAAGAAATT 


GACTCTACAT 


TGTATAAGAA 


CTTATTTGTG 


1860 


GATAAGAATA 


TGAAGTATAT 


TCCAACAGAA 


GATGTTTCAG 


GTCTGTCACT 


GGAAGAACAA 


1920 


TTGAGGAGGT 


TGCAAGAAGA 


ACGAACTTGT 


AAAGTGTGTA 


TGGACAAAGA 


AGTTTCTGTT 


1980 


GTATTTATTC 


CTTGTGGTCA 


TCTGGTAGTA 


TGCCAGGAAT 


GTGCCCCTTC 


TCTAAGAAAA 


2040 


TGCCCTATTT 


GCAGGGGTAT 


AATCAAGGGT 


ACTGTTCGTA 


CATTTCTCTC 


TTAAAGAAAA 


2100 


ATAGTCTATA 


TTTTAACCTG 


CATAAAAAGG 


TCTTTAAAAT 


ATTGTTGAAC 


ACTTGAAGCC 


2160 


ATCTAAAGTA 


AAAAGGGAAT 


TATGAGTTTT 


TCAATTAGTA 


ACATTCATGT 


TCTAGTCTGC 


2220 


TTTGGTACTA 


ATAATCTTGT 


TTCTGAAAAG 


ATGGTATCAT 


ATATTTAATC 


TTAATCTGTT 


2280 


TATTTACAAG 


GGAAGATTTA 


TGTTTGGTGA 


ACTATATTAG 


TATGTATGTG 


TACCTAAGGG 


2340 


AGTAGCGTCN 


CTGCTTGTTA 


TGCATCATTT 


CAGGAGTTAC 


TGGATTTGTT 


GTTCTTTCAG 


2400 


AAAGCTTTGA 


ANACTAAATT 


ATAGTGTAGA 


AAAGAACTGG 


AAACCAGGAA 


CTCTGGAGTT 


2460 


CATCAGAGTT 


ATGGTGCCGA 


ATTGTCTTTG 


GTGCTTTTCA 


CTTGTGTTTT 


AAAATAAGGA 


2520 


TTTTTCTCTT 


ATTTCTCCCC 


CTAGTTTGTG 


AGAAACATCT 


CAATAAAGTG 


CTTTAAAAAG 


2580 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 618 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met His Lys Thr Ala Ser Gin Arg Leu Phe Pro Gly Pro Ser Tyr Gin 
15 10 15 

Asn lie Lys Ser lie Met Glu Asp Ser Thr He Leu Ser Asp Trp Thr 
20 25 30 

Asn Ser Asn Lys Gin Lys Met Lys Tyr Asp Phe Ser Cys Glu Leu Tyr 
35 40 45 



Arg Met Ser Thr Tyr Ser Thr Phe Pro Ala Gly Val Pro Val Ser Glu 
50 55 60 

Arg Ser Leu Ala Arg Ala Gly Phe Tyr Tyr Thr Gly Val Asn Asp Lys 
65 70 75 80 

Val Lys Cys Phe Cys Cys Gly Leu Met Leu Asp Asn Trp Lys Leu Gly 

85 90 95 

Asp Ser Pro lie Gin Lys His Lys Gin Leu Tyr Pro Ser Cys Ser Phe 
100 105 no 

lie Gin Asn Leu Val Ser Ala Ser Leu Gly Ser Thr Ser Lys Asn Thr 
115 120 125 

Ser Pro Met Arg Asn Ser Phe Ala His Ser Leu Ser Pro Thr Leu Glu 
130 135 140 

His Ser Ser Leu Phe Ser Gly Ser Tyr Ser Ser Leu Pro Pro Asn Pro 
145 150 155 160 

Leu Asn Ser Arg Ala Val Glu Asp lie Ser Ser Ser Arg Thr Asn Pro 

165 170 175 

Tyr Ser Tyr Ala Met Ser Thr Glu Glu Ala Arg Phe Leu Thr Tyr His 
180 185 190 

Met Trp Pro Leu Thr Phe Leu Ser Pro Ser Glu Leu Ala Arg Ala Gly 
195 200 205 

Phe Tyr Tyr He Gly Pro Gly Asp Arg Val Ala Cys Phe Ala Cys Gly 
210 215 220 

Gly Lys Leu Ser Asn Trp Glu Pro Lys Asp Asp Ala Met Ser Glu His 
225 230 235 240 

Arg Arg His Phe Pro Asn Cys Pro Phe Leu Glu Asn Ser Leu Glu Thr 

245 250 255 

Leu Arg Phe Ser He Ser Asn Leu Ser Met Gin Thr His Ala Ala Arg 
260 265 270 

Met Arg Thr Phe Met Tyr Trp Pro Ser Ser Val Pro Val Gin Pro Glu 
275 280 285 

Gin Leu Ala Ser Ala Gly Phe Tyr Tyr Val Gly Arg Asn Asp Asp Val 
290 295 300 

Lys Cys Phe Gly Cys Asp Gly Gly Leu Arg Cys Trp Glu Ser Gly Asp 
305 310 315 320 

Asp Pro Trp Val Glu His Ala Lys Trp Phe Pro Arg Cys Glu Phe Leu 

325 330 335 

He Arg Met Lys Gly Gin Glu Phe Val Asp Glu He Gin Gly Arg Tyr 
340 345 350 

Pro His Leu Leu Glu Gin Leu Leu Ser Thr Ser Asp Thr Thr Gly Glu 
355 360 365 

Glu Asn Ala Asp Pro Pro He He His Phe Gly Pro Gly Glu Ser Ser 
370 375 380 
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# 



Ser Glu Asp Ala Val Met Met Asn Thr Pro Val Val Lys Ser Ala Leu 
385 390 395 400 

Glu Met Gly Phe Asn Arg Asp Leu Val Lys Gin Thr Val Leu Ser Lys 

405 410 415 

lie Leu Thr Thr Gly Glu Asn Tyr Lys Thr Val Asn Asp lie Val Ser 
420 425 430 

Ala Leu Leu Asn Ala Glu Asp Glu Lys Arg Glu Glu Glu Lys Glu Lys 
435 440 445 

Gin Ala Glu Glu Met Ala Ser Asp Asp Leu Ser Leu He Arg Lys Asn 
450 455 460 

Arg Met Ala Leu Phe Gin Gin Leu Thr Cys Val Leu Pro He Leu Asp 
465 470 475 480 

Asn Leu Leu Lys Ala Asn Val He Asn Lys Gin Glu His Asp He He 

485 490 495 

Lys Gin Lys Thr Gin He Pro Leu Gin Ala Arg Glu Leu He Asp Thr 
500 505 510 

He Trp Val Lys Gly Asn Ala Ala Ala Asn He Phe Lys Asn Cys Leu 
515 520 525 

Lys Glu He Asp Ser Thr Leu Tyr Lys Asn Leu Phe Val Asp Lys Asn 
530 535 540 

Met Lys Tyr He Pro Thr Glu Asp Val Ser Gly Leu Ser Leu Glu Glu 
545 550 555 560 

Gin Leu Arg Arg Leu Gin Glu Glu Arg Thr Cys Lys Val Cys Met Asp 

565 570 575 

Lys Glu Val Ser Val Val Phe He Pro Cys Gly His Leu Val Val Cys 
580 585 590 

Gin Glu Cys Ala Pro Ser Leu Arg Lys Cys Pro He Cys Arg Gly He 
595 600 605 

He Lys Gly Thr Val Arg Thr Phe Leu Ser 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GACACTCTGC TGGGCGGCGG GCCGCCCTCC TCCGGGACCT CCCCTCGGGA ACCGTCGCCC 



610 



615 
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GCGGCGCTTA GTTAGGACTG GAGTGCTTGG 
GAGAAGATGA CTTTTAACAG TTTTGAAGGA 
AAGGATGAAG AATTTGTAGA AGAGTTTAAT 
AGTAGTCCTG TTTCAGCATC AACATTGGCG 
GACACCGTGC AATGTTTCAG TTGTCATGCG 
GCTGTTGGAA GACACAGGAG AATATCCCCA 
GAAAATGGTG CTGCACAGTC TACAAATCCT 
AACTGTGTGG GAAATAGAAA TCCTTTTGCC 
TATCTCTTGA GAACTGGACA GGTTGTAGAT 
GCCATGTGTA GTGAAGAAGC CAGATTGAAG 
TTAACCCCCA GAGAGTTAGC TAGTGCTGGC 
CAATGCTTTT GTTGTGGGGG AAAACTGAAA 
:F , GAACACAGGA GACACTTTCC CAATTGCTTT 
yQ AGTGAATCTG GTGTGAGTTC TGATAGGAAT 
fVl CCAGCCATGG CAGAATATGA AGCACGGATC 
AACAAGGAGC AGCTTGCAAG AGCTGGATTT 
j-I TGCTTCCACT GTGGAGGAGG GCTCACGGAT 
UJ CATGCTAAGT GCTACCCAGG GTGCAAATAC 
O AATAATATTC ATTTAACCCA TCCACTTGAG 
it: CCACCGCTAA CTAAAAAAAT CGATGATACC 

n 

atacgaatgg gatttagctt caaggacctt 
5? tccgggagca gctatctatc acttgaggtc 
gataatacgg aggatgagtc aagtcaaact 
cagctaaggc gcctacaaga ggagaagctt 
atcgtttttt ttccttgtgg acatctggcc 

AAATGTCCCA TGTGCTACAC CGTCATTACG 
GGCACCACAT GTTATGTTCT TCTTGCTCTA 
TAATCCTGCA TTTGCATTCC ATTAGCATCC 
AGCACTGTTT CCGTCTAAAC ATTCAATTTC 
TAGCCAGTGT TTTACTCGAT TGAAACCTTA 
GTATATTGGT AGTACACTGA CTTGATTTCT 




CGCGAAAAGG TGGACAAGTC CTATTTTCCA 120 

ACTAGAACTT TTGTACTTGC AGACACCAAT 180 

AGATTAAAAA CATTTGCTAA CTTCCCAAGT 240 

CGAGCTGGGT TTCTTTATAC CGGTGAAGGA 300 

GCAATAGATA GATGGCAGTA TGGAGACTCA 360 

AATTGCAGAT TTATCAATGG TTTTTATTTT 420 

GGTATCCAAA ATGGCCAGTA CAAATCTGAA 480 

CCTGACAGGC CACCTGAGAC TCATGCTGAT 540 

ATTTCAGACA CCATATACCC GAGGAACCCT 600 

TCATTTCAGA ACTGGCCGGA CTATGCTCAT 660 

CTCTACTACA CAGGGGCTGA TGATCAAGTG 720 

AATTGGGAAC CCTGTGATCG TGCCTGGTCA 780 

TTTGTTTTGG GCCGGAACGT TAATGTTCGA 840 

TTCCCAAATT CAACAAACTC TCCAAGAAAT 900 

GTTACTTTTG GAACATGGAT ATACTCAGTT 960 

TATGCTTTAG GTGAAGGCGA TAAAGTGAAG 1020 

TGGAAGCCAA GTGAAGACCC CTGGGACCAG 1080 

CTATTGGATG AGAAGGGGCA AGAATATATA 1140 

GAATCTTTGG GAAGAACTGC TGAAAAAACA 1200 

ATCTTCCAGA ATCCTATGGT GCAAGAAGCT 1260 

AAGAAAACAA TGGAAGAAAA AATCCAAACA 1320 

CTGATTGCAG ATCTTGTGAG TGCTCAGAAA 1380 

TCATTGCAGA AAGACATTAG TACTGAAGAG 1440 

TCCAAAATCT G T ATGG AT AG AAATATTGCT 1500 

ACTTGTAAAC AGTGTGCAGA AGCAGTTGAC 1560 

TTCAACCAAA AAATTTTTAT GTCTTAGTGG 1620 

ATTGAATGTG TAATGGGAGC GAACTTTAAG 1680 

TGCTGTTTCC AAATGGAGAC CAATGCTAAC 1740 

TGGATCTTTC GAGTTATCAG CTGTATCATT 1800 

GACAGAGAAG CATTTTATAG CTTTTCACAT 1860 

ATATGTAAGT GAATTCATCA CCTGCATGTT 1920 
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TCATGCCTTT TGCATAAGCT TAACAAATGG AGTGTTCTGT ATAAGCATGG AGATGTGATG 1980 
GAATCTGCCC AATGACTTTA ATTGGCTTAT TGTAAACACG GAAAGAACTG CCCCACGCTG 2040 
CTGGGAGGAT AAAGATTGTT TTAGATGCTC ACTTCTGTGT TTTAGGATTC TGCCCATTTA 2100 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 496 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Thr Phe Asn Ser Phe Glu Gly Thr Arg Thr Phe Val Leu Ala Asp 
15 10 15 

Thr Asn Lys Asp Glu Glu Phe Val Glu Glu Phe Asn Arg Leu Lys Thr 
20 25 30 

Phe Ala Asn Phe Pro Ser Ser Ser Pro Val Ser Ala Ser Thr Leu Ala 
35 40 45 

Arg Ala Gly Phe Leu Tyr Thr Gly Glu Gly Asp Thr Val Gin Cys Phe 
50 55 60 

Ser Cys His Ala Ala lie Asp Arg Trp Gin Tyr Gly Asp Ser Ala Val 
65 70 75 80 

Gly Arg His Arg Arg lie Ser Pro Asn Cys Arg Phe lie Asn Gly Phe 

85 90 95 

Tyr Phe Glu Asn Gly Ala Ala Gin Ser Thr Asn Pro Gly He Gin Asn 
100 105 HO 

Gly Gin Tyr Lys Ser Glu Asn Cys Val Gly Asn Arg Asn Pro Phe Ala 
115 120 125 

Pro Asp Arg Pro Pro Glu Thr His Ala Asp Tyr Leu Leu Arg Thr Gly 
130 135 140 

Gin Val Val Asp He Ser Asp Thr He Tyr Pro Arg Asn Pro Ala Met 
145 150 155 160 

Cys Ser Glu Glu Ala Arg Leu Lys Ser Phe Gin Asn Trp Pro Asp Tyr 

165 170 175 

Ala His Leu Thr Pro Arg Glu Leu Ala Ser Ala Gly Leu Tyr Tyr Thr 
180 185 190 

Gly Ala Asp Asp Gin Val Gin Cys Phe Cys Cys Gly Gly Lys Leu Lys 
195 200 205 

Asn Trp Glu Pro Cys Asp Arg Ala Trp Ser Glu His Arg Arg His Phe 
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210 



215 



220 



Pro Asn Cys Phe Phe Val Leu Gly Arg Asn Val Asn Val Arg Ser Glu 
225 230 235 240 

Ser Gly Val Ser Ser Asp Arg Asn Phe Pro Asn Ser Thr Asn Ser Pro 

245 250 255 

Arg Asn Pro Ala Met Ala Glu Tyr Glu Ala Arg lie Val Thr Phe Gly 
260 265 270 

Thr Trp lie Tyr Ser Val Asn Lys Glu Gin Leu Ala Arg Ala Gly Phe 
275 280 285 

Tyr Ala Leu Gly Glu Gly Asp Lys Val Lys Cys Phe His Cys Gly Gly 
290 295 300 

Gly Leu Thr Asp Trp Lys Pro Ser Glu Asp Pro Trp Asp Gin His Ala 
305 310 315 320 

Lys Cys Tyr Pro Gly Cys Lys Tyr Leu Leu Asp Glu Lys Gly Gin Glu 

325 330 335 

Tyr lie Asn Asn lie His Leu Thr His Pro Leu Glu Glu Ser Leu Gly 
340 345 350 

Arg Thr Ala Glu Lys Thr Pro Pro Leu Thr Lys Lys lie Asp Asp Thr 
355 360 365 

He Phe Gin Asn Pro Met Val Gin Glu Ala He Arg Met Gly Phe Ser 
370 375 380 

Phe Lys Asp Leu Lys Lys Thr Met Glu Glu Lys He Gin Thr Ser Gly 
385 390 395 400 

Ser Ser Tyr Leu Ser Leu Glu Val Leu He Ala Asp Leu Val Ser Ala 

405 410 415 

Gin Lys Asp Asn Thr Glu Asp Glu Ser Ser Gin Thr Ser Leu Gin Lys 
420 425 430 

Asp He Ser Thr Glu Glu Gin Leu Arg Arg Leu Gin Glu Glu Lys Leu 
435 440 445 

Ser Lys He Cys Met Asp Arg Asn He Ala He Val Phe Phe Pro Cys 
450 455 460 

Gly His Leu Ala Thr Cys Lys Gin Cys Ala Glu Ala Val Asp Lys Cys 
465 470 475 480 

Pro Met Cys Tyr Thr Val He Thr Phe Asn Gin Lys lie Phe Met Ser 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 
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490 



495 
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# 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Lys Ala Ala Arg Leu Gly Thr Tyr Thr Asn Trp Pro Val Gin Phe Leu 
1 5 10 15 

Glu Pro Ser Arg Met Ala Ala Ser Gly Phe Tyr Tyr Leu Gly Arg Gly 
20 25 30 

Asp Glu Val Arg Cys Ala Phe Cys Lys Val Glu lie Thr Asn Trp Val 
35 40 45 

Arg Gly Asp Asp Pro Glu Thr Asp His Lys Arg Trp Ala Pro Gin Cys 
50 55 60 

Pro Phe Val 
65 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 275 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Ser Asp Leu Arg Leu Glu Glu Val Arg Leu Asn Thr Phe Glu Lys 
15 10 I 5 

Trp Pro Val Ser Phe Leu Ser Pro Glu Thr Met Ala Lys Asn Gly Phe 
20 25 30 

Tyr Tyr Leu Gly Arg Ser Asp Glu Val Arg Cys Ala Phe Cys Lys Val 
35 40 45 

Glu He Met Arg Trp Lys Glu Gly Glu Asp Pro Ala Ala Asp His Lys 
50 55 60 

Lvs Trp Ala Pro Gin Cys Pro Phe Val Lys Gly He Asp Val Cys Gly 
65 70 75 80 

Ser He Val Thr Thr Asn Asn He Gin Asn Thr Thr Thr His Asp Thr 

85 90 95 

He He Gly Pro Ala His Pro Lys Tyr Ala His Glu Ala Ala Arg Val 
100 105 HO 

Lys Ser Phe His Asn Trp Pro Arg Cys Met Lys Gin Arg Pro Glu Gin 
115 120 125 

Met Ala Asp Ala Gly Phe Phe Tyr Thr Gly Tyr Gly Asp Asn Thr Lys 
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130 135 140 

Cvs Phe Tvr Cys Asp Gly Gly Leu Lys Asp Trp Glu Pro Glu Asp Val 
145 150 155 160 

Pro Trp Glu Gin His Val Arg Trp Phe Asp Arg Cys Ala Tyr Val Gin 

165 170 175 

Leu Val Lys Gly Arg Asp Tyr Val Gin Lys Val lie Thr Glu Ala Cys 
180 185 190 

Val Leu Pro Gly Glu Asn Thr Thr Val Ser Thr Ala Ala Pro Val Ser 
195 200 205 

Glu Pro He Pro Glu Thr Lys He Glu Lys Glu Pro Gin Val Glu Asp 
210 215 220 

Ser Lys Leu Cys Lys He Cys Tyr Val Glu Glu Cys He Val Cys Phe 
225 230 235 240 

Val Pro Cys Gly His Val Val Ala Cys Ala Lys Cys Ala Leu Ser Val 

245 250 255 

Asp Lys Cys Pro Met Cys Arg Lys He Val Thr Ser Val Leu Lys Val 
^ ^ 260 265 270 

Tyr Phe Ser 
275 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 498 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Thr Glu Leu Gly Met Glu Leu Glu Ser Val Arg Leu Ala Thr Phe 
15 10 15 

Gly Glu Trp Pro Leu Asn Ala Pro Val Ser Ala Glu Asp Leu Val Ala 
20 25 30 

Asn Gly Phe Phe Ala Thr Gly Lys Trp Leu Glu Ala Glu Cys His Phe 
35 40 45 

Cys His Val Arg He Asp Arg Trp Glu Tyr Gly Asp Gin Val Ala Glu 
50 55 60 

Arg His Arg Arg Ser Ser Pro He Cys Ser Met Val Leu Ala Pro Asn 

65 70 75 80 

His Cys Gly Asn Val Pro Arg Ser Gin Glu Ser Asp Asn Glu Gly Asn 

85 90 9S 
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Ser Val Val 

Glu Ala Asn 
115 

Thr Pro Gin 
130 

Asp His Val 
145 

Lys Asn Asp 

Pro Arg Val 

Leu Asp Glu 
195 

Tyr Ala Cys 
210 

Ser Asn lie 
225 

Gin Lys He 

Arg Ser Trp 

Ser Pro Lys 
275 

Ser Glu Val 
290 

Thr Ala Pro 
305 

Pro Ala Lys 

Asn Ala He 

Leu Asp Glu 
355 

Ala Leu Glu 
370 

Pro Cys Gin 
385 

Ala Asp Ser 
He Ser Lys 



Asp Ser Pro 
100 

Arg Leu Val 

Ala Leu Ala 

Lys Cys Val 
150 

Asn Ala Phe 
165 

Gin Met Gly 
180 

Leu Gly lie 

Val Asp Ala 

Gin Pro Ala 
230 

Gly Asp Gin 
245 

Gin Lys Glu 
260 

Cys Gin Phe 

Leu Ala Thr 

Ala Pro Thr 
310 

Glu Ala Leu 
325 

Gin Arg Lys 
340 

Leu Leu His 
Val Arg Glu 



Ala Thr Thr 
390 

He Pro Ala 
405 

He Thr Asp 
420 



Glu Ser Cys Ser Cys Pro Asp Leu Leu Leu 
105 110 

Thr Phe Lys Asp Trp Pro Asn Pro Asn He 
120 125 

Lys Ala Gly Phe Tyr Tyr Leu Asn Arg Leu 
135 140 

Trp Cys Asn Gly Val lie Ala Lys Trp Glu 

155 160 

Glu Glu His Lys Arg Phe Phe Pro Gin Cys 
170 175 

Pro Leu He Glu Phe Ala Thr Gly Lys Asn 
185 190 

Gin Pro Thr Thr Leu Pro Leu Arg Pro Lys 
200 205 

Arg Leu Arg Thr Phe Thr Asp Trp Pro He 

215 220 

Ser Ala Leu Ala Gin Ala Gly Leu Tyr Tyr 

235 240 

Val Arg Cys Phe His Cys Asn He Gly Leu 
250 255 

Asp Glu Pro Trp Phe Glu His Ala Lys Trp 
265 270 

Val Leu Leu Ala Lys Gly Pro Ala Tyr Val 
280 285 

Thr Ala Ala Asn Ala Ser Ser Gin Pro Ala 
295 300 

Leu Gin Ala Asp Val Leu Met Asp Glu Ala 

315 320 

Thr Leu Gly lie Asp Gly Gly Val Val Arg 
330 335 

Leu Leu Ser Ser Gly Cys Ala Phe Ser Thr 
345 350 

Asp He Phe Asp Asp Ala Gly Ala Gly Ala 
360 365 

Pro Pro Glu Pro Ser Ala Pro Phe lie Glu 
375 380 

Ser Lys Ala Ala Ser Val Pro He Pro Val 

395 400 

Lys Pro Gin Ala Ala Glu Ala Val Ser Asn 
410 415 

Glu lie Gin Lys Met Ser Val Ser Thr Pro 
425 430 
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Asn Gly Asn Leu Ser 
435 

Arg Leu Cys Lys Val 
450 

Pro Cys Gly His Leu 
465 

Asn Cys Pro Met Cys 

485 



Leu Glu Glu Glu Asn Arg 
440 

Cys Leu Asp Glu Glu Val 
455 

Ala Thr Cys Asn Gin Cys 
470 475 

Arg Ala Asp lie Lys Gly 

490 



Gin Leu Lys Asp Ala 
445 

Gly Val Val Phe Leu 
460 

Ala Pro Ser Val Ala 

480 

Phe Val Arg Thr Phe 
495 



Leu Ser 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Glu Glu Val Arg Leu Asn Thr Phe Glu Lys Trp Pro Val Ser Phe Leu 
15 10 15 

Ser Pro Glu Thr Met Ala Lys Asn Gly Phe Tyr Tyr Leu Gly Arg Ser 
20 25 30 

Asp Glu Val Arg Cys Ala Phe Cys Lys Val Glu He Met Arg Trp Lys 
35 40 45 

Glu Gly Glu Asp Pro Ala Ala Asp His Lys Lys Trp Ala Pro Gin Cys 
50 55 60 

Pro Phe Val 
65 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Glu Ala Asn Arg Leu Val Thr Phe Lys Asp Trp Pro Asn Pro Asn He 
15 10 15 
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Thr Pro Gin Ala Leu Ala Lys Ala Gly Phe Tyr Tyr Leu Asn Arg Leu 
20 25 30 

Asp His Val Lys Cys Val Trp Cys Asn Gly Val He Ala Lys Trp Glu 
35 40 45 

Lys Asn Asp Asn Ala Phe Glu Glu His Lys Arg Phe Phe Pro Gin Cys 
50 55 60 

Pro Arg Val 
65 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

Glu Phe Asn Arg Leu Lys Thr Phe Ala Asn Phe Pro Ser Ser Ser Pro 
15 10 15 

Val Ser Ala Ser Thr Leu Ala Arg Ala Gly Phe Leu Tyr Thr Gly Glu 
20 25 30 

Gly Asp Thr Val Gin Cys Phe Ser Cys His Ala Ala He Asp Arg Trp 
35 40 45 

Gin Tyr Gly Asp Ser Ala Val Gly Arg His Arg Arg He Ser Pro Asn 
50 55 60 

Cys Arg Phe He 
65 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Glu Phe Asn Arg Leu Lys Thr Phe Ala Asn Phe Pro Ser Gly Ser Pro 
15 10 15 

Val Ser Ala Ser Thr Leu Ala Arg Ala Gly Phe Leu Tyr Thr Gly Glu 
20 25 30 

71 



Gly Asp Thr Val Arg Cys Phe Ser Cys His Ala Ala Val Asp Arg Trp 
35 40 45 

Gin Tyr Gly Asp Ser Ala Val Gly Arg His Arg Lys Val Ser Pro Asn 
50 55 60 

Cys Arg Phe lie 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Glu Leu Tyr Arg Met Ser Thr Tyr Ser Thr Phe Pro Ala Gly Val Pro 
15 10 15 

Val Ser Glu Arg Ser Leu Ala Arg Ala Gly Phe Tyr Tyr Thr Gly Val 
20 25 30 

Asn Asp Lys Val Lys Cys Phe Cys Cys Gly Leu Met Leu Asp Asn Trp 
35 40 45 

Lys Arg Gly Asp Ser Pro Thr Glu Lys His Lys Lys Leu Tyr Pro Ser 
50 55 60 

Cys Arg Phe Val 
65 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Glu Leu Tyr Arg Met Ser Thr Tyr Ser Thr Phe Pro Ala Gly Val Pro 
15 10 15 

Val Ser Glu Arg Ser Leu Ala Arg Ala Gly Phe Tyr Tyr Thr Gly Val 
20 25 30 

Asn Asp Lys Val Lys Cys Phe Cys Cys Gly Leu Met Leu Asp Asn Trp 
35 40 45 
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Lys Leu Gly Asp Ser Pro lie Gin Lys His Lys Gin Leu Tyr Pro Ser 
50 55 60 

Cys Ser Phe lie 



INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE; protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Glu Glu Ala Arg Leu Lys Ser Phe Gin Asn Trp Pro Asp Tyr Ala His 
15 10 15 



Leu Thr Pro Arg Glu Leu Ala Ser Ala Gly Leu Tyr Tyr Thr Gly Ala 
20 25 30 



Asp Asp Gin Val Gin Cys Phe Cys Cys Gly Gly Lys Leu Lys Asn Trp 
35 40 45 



Glu Pro Cys Asp Arg Ala Trp Ser Glu His Arg Arg His Phe Pro Asn 
50 55 60 



Cys Phe Phe Val 
65 



INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Glu Glu Ala Arg Leu Lys Ser Phe Gin Asn Trp Pro Asp Tyr Ala His 
15 10 15 

Leu Thr Pro Arg Glu Leu Ala Ser Ala Gly Leu Tyr Tyr Thr Gly He 
20 25 30 

Gly Asp Gin Val Gin Cys Phe Cys Cys Gly Gly Lys Leu Lys Asn Trp 
35 40 45 

Glu Pro Cys Asp Arg Ala Trp Ser Glu His Arg Arg His Phe Pro Asn 
50 55 60 



Cys Phe Phe Val 
65 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
( D } TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Glu Asn Ala Arg Leu Leu Thr Phe Gin Thr Trp Pro Leu Thr Phe Leu 
15 10 15 

Ser Pro Thr Asp Leu Ala Arg Ala Gly Phe Tyr Tyr He Gly Pro Gly 
20 25 30 

f*% Asp Arg Val Ala Cys Phe Ala Cys Gly Gly Lys Leu Ser Asn Trp Glu 

35 40 45 

CM p ro Lys Asp Asn Ala Met Ser Glu His Leu Arg His Phe Pro Lys Cys 

Iff 50 55 60 



Pro Phe He 
65 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Glu Glu Ala Arg Phe Leu Thr Tyr His Met Trp Pro Leu Thr Phe Leu 
15 10 15 

Ser Pro Ser Glu Leu Ala Arg Ala Gly Phe Tyr Tyr He Gly Pro Gly 
20 25 30 

Asp Arg Val Ala Cys Phe Ala Cys Gly Gly Lys Leu Ser Asn Trp Glu 
35 40 45 

Pro Lys Asp Asp Ala Met Ser Glu His Arg Arg His Phe Pro Asn Cys 
50 55 60 

Pro Phe Leu 
65 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: not relevant 
(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Tyr Glu Ala Arg lie Val Thr Phe Gly Thr Trp lie Tyr Ser Val Asn 
15 10 15 

Lys Glu Gin Leu Ala Arg Ala Gly Phe Tyr Ala Leu Gly Glu Gly Asp 
20 25 30 

Lys Val Lys Cys Phe His Cys Gly Gly Gly Leu Thr Asp Trp Lys Pro 
35 40 45 

Ser Glu Asp Pro Trp Asp Gin His Ala Lys Cys Tyr Pro Gly Cys Lys 
50 55 60 

Tyr Leu 
65 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Tyr Glu Ala Arg lie Phe Thr Phe Gly Thr Trp He Tyr Ser Val Asn 
15 10 15 

Lys Glu Gin Leu Ala Arg Ala Gly Phe Tyr Ala Leu Gly Glu Gly Asp 
20 25 30 

Lys Val Lys Cys Phe His Cys Gly Gly Gly Leu Thr Asp Trp Lys Pro 
35 40 45 

Ser Glu Asp Pro Trp Glu Gin His Ala Lys Trp Tyr Pro Gly Cys Lys 
50 55 60 



Tyr Leu 
65 



(2) INFORMATION FOR SEQ ID NO: 26: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

His Ala Ala Arg Phe Lys Thr Phe Phe Asn Trp Pro Ser Ser Val Leu 
15 10 15 

Val Asn Pro Glu Gin Leu Ala Ser Ala Gly Phe Tyr Tyr Val Gly Asn 
20 25 30 

Ser Asp Asp Val Lys Cys Phe Cys Cys Asp Gly Gly Leu Arg Cys Trp 
35 40 45 

Glu Ser Gly Asp Asp Pro Trp Val Gin His Ala Lys Trp Phe Pro Arg 
50 55 60 

Cys Glu Tyr Leu 
65 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

His Ala Ala Arg Met Arg Thr Phe Met Tyr Trp Pro Ser Ser Val Pro 
15 10 15 

Val Gin Pro Glu Gin Leu Ala Ser Ala Gly Phe Tyr Tyr Val Gly Arg 
20 25 30 

Asn Asp Asp Val Lys Cys Phe Gly Cys Asp Gly Gly Leu Arg Cys Trp 
35 40 45 

Glu Ser Gly Asp Asp Pro Trp Val Glu His Ala Lys Trp Phe Pro Arg 

50 55 60 



Cys Glu Phe Leu 
65 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS; not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Glu Ala Ala Arg Leu Arg Thr Phe Ala Glu Trp Pro Arg Gly Leu Lys 
15 10 15 

Gin Arg Pro Glu Glu Leu Ala Glu Ala Gly Phe Phe Tyr Thr Gly Gin 
20 25 30 

Gly Asp Lys Thr Arg Cys Phe Cys Cys Asp Gly Gly Leu Lys Asp Trp 
35 40 45 

Glu Pro Asp Asp Ala Pro Trp Gin Gin His Ala Arg Trp Tyr Asp Arg 
50 55 60 

Cys Glu Tyr Val 
65 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

( B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Glu Ala Ala Arg Val Lys Ser Phe His Asn Trp Pro Arg Cys Met Lys 
15 10 15 

Gin Arg Pro Glu Gin Met Ala Asp Ala Gly Phe Phe Tyr Thr Gly Tyr 
20 25 30 

Gly Asp Asn Thr Lys Cys Phe Tyr Cys Asp Gly Gly Leu Lys Asp Trp 
35 40 45 

Glu Pro Glu Asp Val Pro Trp Glu Gin His Val Arg Trp Phe Asp Arg 
50 55 60 

Cys Ala Tyr Val 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
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(D) TOPOLOGY: both 
(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

Val Asp Ala Arg Leu Arg Thr Phe Thr Asp Trp Pro lie Ser Asn lie 
15 10 15 

Gin Pro Ala Ser Ala Leu Ala Gin Ala Gly Leu Tyr Tyr Gin Lys lie 
20 25 30 

Gly Asp Gin Val Arg Cys Phe His Cys Asn lie Gly Leu Arg Ser Trp 
35 40 45 

Gin Lys Glu Asp Glu Pro Trp Phe Glu His Ala Lys Trp Ser Pro Lys 
50 55 60 

Cys Gin Phe Val 
65 

INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Glu Ser Val Arg Leu Ala Thr Phe Gly Glu Trp Pro Leu Asn Ala Pro 
15 10 15 

Val Ser Ala Glu Asp Leu Val Ala Asn Gly Phe Phe Gly Thr Trp Met 
20 25 30 

Glu Ala Glu Cys Asp Phe Cys His Val Arg lie Asp Arg Trp Glu Tyr 
35 40 45 

Gly Asp Leu Val Ala Glu Arg His Arg Arg Ser Ser Pro lie Cys Ser 
50 55 60 

Met Val 
65 

INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D ) TOPOLOGY: both 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Glu Gin Leu Arg Arg Leu Gin Glu Glu Arg Thr Cys Lys Val Cys Met 
15 10 15 

Asp Lys Glu Val Ser Val Val Phe He Pro Cys Gly His Leu Val Val 
20 25 30 

Cys Gin Glu Cys Ala Pro Ser Leu Arg Lys Cys Pro He Cys 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Glu Gin Leu Arg Arg Leu Pro Glu Glu Arg Thr Cys Lys Val Cys Met 
15 10 15 

Asp Lys Glu Val Ser He Val Phe He Pro Cys Gly His Leu Val Val 
20 25 30 

Cys Lys Asp Cys Ala Pro Ser Leu Arg Lys Cys Pro He Cys 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

Glu Gin Leu Arg Arg Leu Gin Glu Glu Lys Leu Ser Lys He Cys Met 

15 10 15 

Asp Arg Asn He Ala He Val Phe Phe Pro Cys Gly His Leu Ala Thr 
20 25 30 

Cys Lys Gin Cys Ala Glu Ala Val Asp Lys Cys Pro Met Cys 
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35 40 45 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Glu Gin Leu Arg Arg Leu Gin Glu Glu Lys Leu Cys Lys lie Cys Met 
15 10 15 

Asp Arg Asn He Ala He Val Phe Val Pro Cys Gly His Leu Val Thr 
20 25 30 

Cys Lys Gin Cys Ala Glu Ala Val Asp Lys Cys Pro Met Cys 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

Glu Glu Asn Arg Gin Leu Lys Asp Ala Arg Leu Cys Lys Val Cys Leu 
15 10 15 

Asp Glu Glu Val Gly Val Val Phe Leu Pro Cys Gly His Leu Ala Thr 
20 25 30 

Cys Asn Gin Cys Ala Pro Ser Val Ala Asn Cys Pro Met Cys 
35 40 45 

2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ 10 NO; 37: 

Glu Lys Glu Pro Gin Val Glu Asp Ser Lys Leu Cys Lys lie Cys Tyr 
15 10 15 

Val Glu Glu Cys lie Val Cys Phe Val Pro Cys Gly His Val Val Ala 
20 25 30 

Cys Ala Lys Cys Ala Leu Ser Val Asp Lys Cys Pro Met Cys 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Ala Val Glu Ala Glu Val Ala Asp Asp Arg Leu Cys Lys lie Cys Leu 
15 10 15 

Gly Ala Glu Lys Thr Val Cys Phe Val Pro Cys Gly His Val Val Ala 
20 25 30 

Cys Gly Lys Cys Ala Ala Gly Val Thr Thr Cys Pro Val Cys 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2474 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

GAATTCCGGG AGACCTACAC CCCCGGAGAT CAGAGGTCAT TGCTGGCGTT CAGAGCCTAG 60 

GAAGTGGGCT GCGGTATCAG CCTAGCAGTA AAACCGACCA GAAGCCATGC ACAAAACTAC 120 

ATCCCCAGAG AAAGACTTGT CCCTTCCCCT CCCTGTCATC TCACCATGAA CATGGTTCAA 180 

GACAGCGCCT TTCTAGCCAA GCTGATGAAG AGTGCTGACA CCTTTGAGTT GAAGTATGAC 240 

TTTTCCTGTG AGCTGTACCG ATTGTCCACG TATTCAGCTT TTCCCAGGGG AGTTCCTGTG 300 

TCAGAAAGGA GTCTGGCTCG TGCTGGCTTT TACTACACTG GTGCCAATGA CAAGGTCAAG 360 
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# + 



TGCTTCTGCT 


GTGGCCTGAT 


GCTAGACAAC 


TGGAAACAAG 


GGGACAGTCC 


CATGGAGAAG 


420 


CACAGAAAGT 


TGTACCCCAG 


CTGCAACTTT 


GTACAGACTT 


TGAATCCAGC 


CAACAGTCTG 


480 


GAAGCTAGTC 


CTCGGCCTTC 


TCTTCCTTCC 


ACGGCGATGA 


GCACCATGCC 


TTTGAGCTTT 


540 


GCAAGTTCTG 


AGAATACTGG 


CTATTTCAGT 


GGCTCTTACT 


CGAGCTTTCC 


CTCAGACCCT 


600 


GTGAACTTCC 


GAGCAAATCA 


AGATTGTCCT 


GCTTTGAGCA 


CAAGTCCCTA 


CCACTTTGCA 


660 


ATGAACACAG 


AGAAGGCCAG 


ATTACTCACC 


TATGAAACAT 


GGCCATTGTC 


TTTTCTGTCA 


720 


CCAGCAAAGC 


TGGCCAAAGC 


AGGCTTCTAC 


TACATAGGAC 


CTGGAGATAG 


AGTGGCCTGC 


780 


TTTGCGTGCG 


ATGGGAAACT 


GAGCAACTGG 


GAACGTAAGG 


ATGATGCTAT 


GTCAGAGCAC 


840 


CAGAGGCATT 


TCCCCAGCTG 


TCCGTTCTTA 


AAAGACTTGG 


GTCAGTCTGC 


TTCGAGATAC 


900 


ACTGTCTCTA 


ACCTGAGCAT 


GCAGACACAC 


GCAGCCCGTA 


TTAGAACATT 


CTCTAACTGG 


960 


CCTTCTAGTG 


CACTAGTTCA 


TTCCCAGGAA 


CTTGCAAGTG 


CGGGCTTTTA 


TTATACAGGA 


1020 


CACAGTGATG 


ATGTCAAGTG 


TTTATGCTGT 


GATGGTGGGC 


TGAGGTGCTG 


GGAATCTGGA 


1080 


GATGACCCCT 


GGGTGGAACA 


TGCCAAGTGG 


TTTCCAAGGT 


GTGAGTACTT 


GCTCAGAATC 


1140 


AAAGGCCAAG 


AATTTGTCAG 


CCAAGTTCAA 


GCTGGCTATC 


CTCATCTACT 


TGAGCAGCTA 


1200 


TTATCTACGT 


CAGACTCCCC 


AGAAGATGAG 


AATGCAGACG 


CAGCAATCGT 


GCATTTTGGC 


1260 


CCTGGAGAAA 


GTTCGGAAGA 


TGTCGTCATG 


ATGAGCACGC 


CTGTGGTTAA 


AGCAGCCTTG 


1320 


GAAATGGGCT 


TCAGTAGGAG 


CCTGGTGAGA 


CAGACGGTTC 


AGTGGCAGAT 


CCTGGCCACT 


1380 


GGTGAGAACT 


ACAGGACCGT 


CAGTGACCTC 


GTTATAGGCT 


TACTCGATGC 


AGAAGACGAG 


1440 


ATGAGAGAGG 


AGCAGATGGA 


GCAGGCGGCC 


GAGGAGGAGG 


AGTCAGATGA 


TCTAGCACTA 


1500 


ATCCGGAAGA 


ACAAAATGGT 


GCTTTTCCAA 


CATTTGACGT 


GTGTGACACC 


AATGCTGTAT 


1560 


TGCCTCCTAA 


GTGCAAGGGC 


CATCACTGAA 


CAGGAGTGCA 


ATGCTGTGAA 


ACAGAAACCA 


1620 


CACACCTTAC 


AAGCAAGCAC 


ACTGATTGAT 


ACTGTGTTAG 


CAAAAGGAAA 


CACTGCAGCA 


1680 


ACCTCATTCA 


GAAACTCCCT 


TCGGGAAATT 


GACCCTGCGT 


TATACAGAGA 


TATATTTGTG 


1740 


CAACAGGACA 


TTAGGAGTCT 


TCCCACAGAT 


GACATTGCAG 


CTCTACCAAT 


GGAAGAACAG 


1800 


TTGCGGCCCC 


TCCCGGAGGA 


CAGAATGTGT 


AAAGTGTGTA 


TGGACCGAGA 


GGTATCCATC 


1860 


GTGTTCATTC 


CCTGTGGCCA 


TCTGGTCGTG 


TGCAAAGACT 


GCGCTCCCTC 


TCTGAGGAAG 


1920 


TGTCCCATCT 


GTAGAGGGAC 


CATCAAGGGC 


ACAGTGCGCA 


CATTTCTCTC 


CTGAACAAGA 


1980 


CTAATGGTCC 


ATGGCTGCAA 


CTTCAGCCAG 


GAGGAAGTTC 


ACTGTCACTC 


CCAGTTCCAT 


2040 


TCGGAACTTG 


AGGCCAGCCT 


GGATAGCACG 


AGACACCGCC 


AAACACACAA 


ATATAAACAT 


2100 


GAAAAACTTT 


TGTCTGAAGT 


CAAGAATGAA 


TGAATTACTT 


ATATAATAAT 


TTTAATTGGT 


2160 


TTCCTTAAAA 


GTGCTATTTG 


TTCCCAACTC 


AGAAAATTGT 


TTTCTGTAAA 


CATATTTACA 


2220 



82 



# • 



TACTACCTGC ATCTAAAGTA TTCATATATT CATATATTCA GATGTCATGA GAGAGGGTTT 2280 

TGTTCTTGTT CCTGAAAAGC TGGTTTATCA TCTGATCAGC ATATACTGCG CAACGGGCAG 2340 

GGCTAGAATC CATGAACCAA GCTGCAAAGA TCTCACGCTA AATAAGGCGG AAAGATTTGG 2400 

AGAAACGAAA GGAAATTCTT TCCTGTCCAA TGTATACTCT TCAGACTAAT GACCTCTTCC 2460 

TATCAAGCCT TCTA 2474 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 602 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Met Asn Met Val Gin Asp Ser Ala Phe Leu Ala Lys Leu Met Lys Ser 
15 10 15 

Ala Asp Thr Phe Glu Leu Lys Tyr Asp Phe Ser Cys Glu Leu Tyr Arg 
20 25 30 

Leu Ser Thr Tyr Ser Ala Phe Pro Arg Gly Val Pro Val Ser Glu Arg 
35 40 45 

Ser Leu Ala Arg Ala Gly Phe Tyr Tyr Thr Gly Ala Asn Asp Lys Val 
50 55 60 

Lys Cys Phe Cys Cys Gly Leu Met Leu Asp Asn Trp Lys Gin Gly Asp 
65 70 75 80 

Ser Pro Met Glu Lys His Arg Lys Leu Tyr Pro Ser Cys Asn Phe Val 

85 90 95 

Gin Thr Leu Asn Pro Ala Asn Ser Leu Glu Ala Ser Pro Arg Pro Ser 
100 105 HO 

Leu Pro Ser Thr Ala Met Ser Thr Met Pro Leu Ser Phe Ala Ser Ser 
115 120 125 

Glu Asn Thr Gly Tyr Phe Ser Gly Ser Tyr Ser Ser Phe Pro Ser Asp 
130 135 140 

Pro Val Asn Phe Arg Ala Asn Gin Asp Cys Pro Ala Leu Ser Thr Ser 
145 150 155 160 

Pro Tyr His Phe Ala Met Asn Thr Glu Lys Ala Arg Leu Leu Thr Tyr 

165 170 175 

Glu Thr Trp Pro Leu Ser Phe Leu Ser Pro Ala Lys Leu Ala Lys Ala 
180 185 190 

Gly Phe Tyr Tyr lie Gly Pro Gly Asp Arg Val Ala Cys Phe Ala Cys 
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195 200 205 

Asp Gly Lys Leu Ser Asn Trp Glu Arg Lys Asp Asp Ala Met Ser Glu 
210 215 220 

His Gin Arg His Phe Pro Ser Cys Pro Phe Leu Lys Asp Leu Gly Gin 
225 230 235 240 

Ser Ala Ser Arg Tyr Thr Val Ser Asn Leu Ser Met Gin Thr His Ala 

245 250 255 

Ala Arg lie Arg Thr Phe Ser Asn Trp Pro Ser Ser Ala Leu Val His 
260 265 270 

Ser Gin Glu Leu Ala Ser Ala Gly Phe Tyr Tyr Thr Gly His Ser Asp 
275 280 285 

Asp Val Lys Cys Leu Cys Cys Asp Gly Gly Leu Arg Cys Trp Glu Ser 
290 295 300 

Gly Asp Asp Pro Trp Val Glu His Ala Lys Trp Phe Pro Arg Cys Glu 
305 310 315 320 

Tyr Leu Leu Arg lie Lys Gly Gin Glu Phe Val Ser Gin Val Gin Ala 

325 330 335 

Gly Tyr Pro His Leu Leu Glu Gin Leu Leu Ser Thr Ser Asp Ser Pro 
340 345 350 

Glu Asp Glu Asn Ala Asp Ala Ala lie Val His Phe Gly Pro Gly Glu 
355 360 365 

Ser Ser Glu Asp Val Val Met Met Ser Thr Pro Val Val Lys Ala Ala 
370 375 380 

Leu Glu Met Gly Phe Ser Arg Ser Leu Val Arg Gin Thr Val Gin Trp 
385 390 395 400 

Gin lie Leu Ala Thr Gly Glu Asn Tyr Arg Thr Val Ser Asp Leu Val 

405 410 415 

lie Gly Leu Leu Asp Ala Glu Asp Glu Met Arg Glu Glu Gin Met Glu 
420 425 430 

Gin Ala Ala Glu Glu Glu Glu Ser Asp Asp Leu Ala Leu lie Arg Lys 
435 440 445 

Asn Lys Met Val Leu Phe Gin His Leu Thr Cys Val Thr Pro Met Leu 
450 455 460 

Tyr Cys Leu Leu Ser Ala Arg Ala lie Thr Glu Gin Glu Cys Asn Ala 
465 470 475 480 

Val Lys Gin Lys Pro His Thr Leu Gin Ala Ser Thr Leu lie Asp Thr 

485 490 495 

Val Leu Ala Lys Gly Asn Thr Ala Ala Thr Ser Phe Arg Asn Ser Leu 
500 505 510 

Arg Glu lie Asp Pro Ala Leu Tyr Arg Asp lie Phe Val Gin Gin Asp 
515 520 525 
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He Arg Ser Leu Pro Thr Asp Asp He 
530 535 



Ala 



Ala Leu Pro Met Glu Glu 
540 



Gin Leu Arg Pro Leu Pro Glu Asp Arg 
545 550 



Met 



Cys Lys Val Cys Met Asp 
555 560 



Arg Glu Val Ser He Val Phe He Pro 

565 



Cys 



570 



Gly His Leu Val Val Cys 

575 



Lys Asp Cys Ala Pro Ser Leu Arg Lys 
580 585 



Cys 



Pro He Cys Arg Gly Thr 
590 



He Lys Gly Thr Val Arg Thr Phe Leu 
595 600 



Ser 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2416 base pairs 

(B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

CTGTGGTGGA GATCTATTGT CCAAGTGGTG AGAAACTTCA TCTGGAAGTT TAAGCGGTCA 60 

GAAATACTAT TACTACTCAT GGACAAAACT GTCTCCCAGA GACTCGCCCA AGGTACCTTA 120 

CACCCAAAAA CTTAAACGTA TAATGGAGAA GAGCACAATC TTGTCAAATT GGACAAAGGA 180 

GAGCGAAGAA AAAATGAAGT TTGACTTTTC GTGTGAACTC TACCGAATGT CTACATATTC 240 

AGCTTTTCCC AGGGGAGTTC CTGTCTCAGA GAGGAGTCTG GCTCGTGCTG GCTTTTATTA 300 

TACAGGTGTG AATGACAAAG TCAAGTGCTT CTGCTGTGGC CTGATGTTGG ATAACTGGAA 360 

ACAAGGGGAC AGTCCTGTTG AAAAGCACAG ACAGTTCTAT CCCAGCTGCA GCTTTGTACA 420 

GACTCTGCTT TCAGCCAGTC TGCAGTCTCC ATCTAAGAAT ATGTCTCCTG TGAAAAGTAG 480 

ATTTGCACAT TCGTCACCTC TGGAACGAGG TGGCATTCAC TCCAACCTGT GCTCTAGCCC 540 



TCTTAATTCT 


AGAGCAGTGG 


AAGACTTCTC 


ATCAAGGATG 


GATCCCTGCA 


GCTATGCCAT 


600 


GAGTACAGAA 


GAGGCCAGAT 


TTCTTACTTA 


CAGTATGTGG 


CCTTTAAGTT 


TTCTGTCACC 


660 


AGCAGAGCTG 


GCCAGAGCTG 


GCTTCTATTA 


CATAGGGCCT 


GGAGACAGGG 


TGGCCTGTTT 


720 


TGCCTGTGGT 


GGGAAACTGA 


GCAACTGGGA 


ACCAAAGGAT 


TATGCTATGT 


CAGAGCACCG 


780 


CAGACATTTT 


CCCCACTGTC 


CATTTCTGGA 


AAATACTTCA 


GAAACACAGA 


GGTTTAGTAT 


840 


ATCAAATCTA 


AGTATGCAGA 


CACACTCTGC 


TCGATTGAGG 


ACATTTCTGT 


ACTGGCCACC 


900 


TAGTGTTCCT 


GTTCAGCCCG 


AGCAGCTTGC 


AAGTGCTGGA 


TTCTATTACG 


TGGATCGCAA 


960 
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TGATGATGTC 


AAGTGCCTTT 


GTTGTGATGG 


TGGCTTGAGA 


TGTTGGGAAC 


CTGGAGATGA 


1020 


CCCCTGGATA 


GAACACGCCA 


AATGGTTTCC 


AAGGTGTGAG 


TTCTTGATAC 


GGATGAAGGG 


1080 


TCAGGAGTTT 


GTTGATGAGA 


TTCAAGCTAG 


ATATCCTCAT 


CTTCTTGAGC 


AGCTGTTGTC 


1140 


CACTTCAGAC 


ACCCCAGGAG 


AAGAAAATGC 


TGACCCTACA 


GAGACAGTGG 


TGCATTTTGG 


1200 


CCCTGGAGAA 


AGTTCGAAAG 


ATGTCGTCAT 


GATGAGCACG 


CCTGTGGTTA 


AAGCAGCCTT 


1260 


GGAAATGGGC 


TTCAGTAGGA 


GCCTGGTGAG 


ACAGACGGTT 


CAGCGGCAGA 


TCCTGGCCAC 


1320 


TGGTGAGAAC 


TACAGGACCG 


TCAATGATAT 


TGTCTCAGTA 


CTTTTGAATG 


CTGAAGATGA 


1380 


GAGAAGAGAA 


GAGGAGAAGG 


AAAGACAGAC 


TGAAGAGATG 


GCATCAGGTG 


ACTTATCACT 


1440 


GATTCGGAAG 


AATAGAATGG 


CCCTCTTTCA 


ACAGTTGACA 


CATGTCCTTC 


CTATCCTGGA 


1500 


TAATCTTCTT 


GAGGCCAGTG 


TAATTACAAA 


ACAGGAACAT 


GATATTATTA 


GACAGAAAAC 


1560 


ACAGATACCC 


TTACAAGCAA 


GAGAGCTTAT 


TGACACCGTT 


TTAGTCAAGG 


GAAATGCTGC 


1620 


AGCCAACATC 


TTCAAAAACT 


CTCTGAAGGG 


AATTGACTCC 


ACGTTATATG 


AAAACTTATT 


1680 


O TGTGGAAAAG 


AATATGAAGT 


ATATTCCAAC 


AGAAGACGTT 


TCAGGCTTGT 


CATTGGAAGA 


1740 


S GCAGTTGCGG 


AGATTACAAG 


AAGAACGAAC 


TTGCAAAGTG 


TGTATGGACA 


GAGAGGTTTC 


1800 


y \ TATTGTGTTC 


ATTCCGTGTG 


GTCATCTAGT 


AGTCTGCCAG 


GAATGTGCCC 


CTTCTCTAAG 


1860 


%j GAAGTGCCCC 


ATCTGCAGGG 


GGACAATCAA 


GGGGACTGTG 


CGCACATTTC 


TCTCATGAGT 


1920 


HN GAAGAATGGT 


CTGAAAGTAT 


TGTTGGACAT 


CAGAAGCTGT 


CAGAACAAAG 


AATGAACTAC 


1980 


s TGATTTCAGC 


TCTTCAGCAG 


GACATTCTAC 


TCTCTTTCAA 


GATTAGTAAT 


CTTGCTTTAT 


2040 


^ GAAGGGTAGC 


ATTGTATATT 


TAAGCTTAGT 


CTGTTGCAAG 


GGAAGGTCTA 


TGCTGTTGAG 


2100 


Q CTACAGGACT 


GTGTCTGTTC 


CAGAGCAGGA 


GTTGGGATGC 


TTGCTGTATG 


TCCTTCAGGA 


2160 


^ CTTCTTGGGA 


TTTGGGAATT 


TGGGGAAAGC 


TTTGGAATCC 


AGTGATGTGG 


AGCTCAGAAA 


2220 


□ TCCTGGAACC 


AGTGACTCTG 


GTACTCAGTA 


GATAGGGTAC 


CCTGTACTTC 


TTGGTGCTTT 


2280 


TCCAGTCTGG 


GAAATAAGGA 


GGAATCTGCT 


GCTGGTAAAA 


ATTTGCTGGA 


TGTGAGAAAT 


2340 


AGATGAAAGT 


GTTTCGGGTG 


GGGGCGTGCA 


TCAGTGTAGT 


GTGTGCAGGG 


ATGTATGCAG 


2400 


GCCAAACACT 


GTGTAG 










2416 


(2) INFORMATION FOR SEQ ID NO: 42 


* 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 591 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Glu Lys Ser Thr lie Leu Ser Asn Trp Thr Lys Glu Ser Glu Glu 
15 10 15 

Lys Met Lys Phe Asp Phe Ser Cys Glu Leu Tyr Arg Met Ser Thr Tyr 
20 25 30 

Ser Ala Phe Pro Arg Gly Val Pro Val Ser Glu Arg Ser Leu Ala Arg 
35 40 45 

Ala Gly Phe Tyr Tyr Thr Gly Val Asn Asp Lys Val Lys Cys Phe Cys 
50 55 60 

Cys Gly Leu Met Leu Asp Asn Trp Lys Gin Gly Asp Ser Pro Val Glu 
65 70 75 80 

Lys His Arg Gin Phe Tyr Pro Ser Cys Ser Phe Val Gin Thr Leu Leu 

85 90 95 

Ser Ala Ser Leu Gin Ser Pro Ser Lys Asn Met Ser Pro Val Lys Ser 
100 105 110 

Arg Phe Ala His Ser Ser Pro Leu Glu Arg Gly Gly lie His Ser Asn 
115 120 125 

Leu Cys Ser Ser Pro Leu Asn Ser Arg Ala Val Glu Asp Phe Ser Ser 
130 135 140 

Arg Met Asp Pro Cys Ser Tyr Ala Met Ser Thr Glu Glu Ala Arg Phe 
145 150 155 160 

Leu Thr Tyr Ser Met Trp Pro Leu Ser Phe Leu Ser Pro Ala Glu Leu 

165 170 175 

Ala Arg Ala Gly Phe Tyr Tyr lie Gly Pro Gly Asp Arg Val Ala Cys 
180 185 190 

Phe Ala Cys Gly Gly Lys Leu Ser Asn Trp Glu Pro Lys Asp Tyr Ala 
195 200 205 

Met Ser Glu His Arg Arg His Phe Pro His Cys Pro Phe Leu Glu Asn 
210 215 220 

Thr Ser Glu Thr Gin Arg Phe Ser lie Ser Asn Leu Ser Met Gin Thr 
225 230 235 240 

His Ser Ala Arg Leu Arg Thr Phe Leu Tyr Trp Pro Pro Ser Val Pro 

245 250 255 

Val Gin Pro Glu Gin Leu Ala Ser Ala Gly Phe Tyr Tyr Val Asp Arg 
260 265 270 

Asn Asp Asp Val Lys Cys Leu Cys Cys Asp Gly Gly Leu Arg Cys Trp 
275 280 285 

Glu Pro Gly Asp Asp Pro Trp lie Glu His Ala Lys Trp Phe Pro Arg 
290 295 300 

Cys Glu Phe Leu lie Arg Met Lys Gly Gin Glu Phe Val Asp Glu lie 
305 310 315 320 
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Gin Ala Arg Tyr Pro His Leu Leu Glu Gin Leu Leu Ser Thr Ser Asp 

325 330 335 

Thr Pro Gly Glu Glu Asn Ala Asp Pro Thr Glu Thr Val Val His Phe 
340 345 350 

Gly Pro Gly Glu Ser Ser Lys Asp Val Val Met Met Ser Thr Pro Val 
355 360 365 

Val Lys Ala Ala Leu Glu Met Gly Phe Ser Arg Ser Leu Val Arg Gin 
370 375 380 

Thr Val Gin Arg Gin lie Leu Ala Thr Gly Glu Asn Tyr Arg Thr Val 
385 390 395 400 

Asn Asp lie Val Ser Val Leu Leu Asn Ala Glu Asp Glu Arg Arg Glu 

405 410 415 

Glu Glu Lys Glu Arg Gin Thr Glu Glu Met Ala Ser Gly Asp Leu Ser 
420 425 430 

Leu lie Arg Lys Asn Arg Met Ala Leu Phe Gin Gin Leu Thr His Val 
435 440 445 

Leu Pro lie Leu Asp Asn Leu Leu Glu Ala Ser Val lie Thr Lys Gin 
450 455 460 

Glu His Asp lie lie Arg Gin Lys Thr Gin lie Pro Leu Gin Ala Arg 
465 470 475 480 

Glu Leu lie Asp Thr Val Leu Val Lys Gly Asn Ala Ala Ala Asn lie 

485 490 495 

Phe Lys Asn Ser Leu Lys Gly lie Asp Ser Thr Leu Tyr Glu Asn Leu 
500 505 510 

Phe Val Glu Lys Asn Met Lys Tyr lie Pro Thr Glu Asp Val Ser Gly 
515 520 525 

Leu Ser Leu Glu Glu Gin Leu Arg Arg Leu Gin Glu Glu Arg Thr Cys 
530 < 535 540 

Lys Val Cys Met Asp Arg Glu Val Ser lie Val Phe lie Pro Cys Gly 
545 550 555 560 

His Leu Val Val Cys Gin Glu Cys Ala Pro Ser Leu Arg Lys Cys Pro 

565 570 575 

lie Cys Arg Gly Thr He Lys Gly Thr Val Arg Thr Phe Leu Ser 
580 585 590 
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